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Such as STAR plots, PROFLE plots, CODED SCATTER plots and CODED DRAFTSMAN plots. These 
techniques are examined and implemented in a series of computer programs which produces 
‘these graphical displays. A technical description of each computer program is presented 
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ABSTRACT 


This thesis presents several interactive computer programs for the analysis of 
multivariate data. A special case is that of panel data; multiple time series of short 
length. The first program, BOXPLOTAD, handles this type of multivariate data; it is 
an enhancement on an existing graphical technique for exploratory data analysis 
known as BOXPLOTS. The program works by appending boxplots as column dividers 
in a table of the raw data which originates the box plots. This combination of the raw 
data and the graphical representation of that data improves the understanding of the 
characteristics of the data in exploratory and descriptive applications; differencing and 
tracing of data through the table is also implemented. This thesis also presents and 
explores the use of other graphical techniques for exploratory data analysis of 
multivariate data such as STAR plots, PROFILE plots, CODED SCATTER plots and 
CODED DRAFTSMAN plots. These techniques are examined and implemented in a 
series of computer programs which produces these graphical displays. A technical 
description of each computer program is presented and user implementation procedures 
are discussed. The programs are implemented in APL and run in conjunction with the 
experimental IBM APL Graphics program GRAFSTAT. To demonstrate the use of 


these techniques, an analysis is conducted on several sets of multivariate data. 


THESIS DISCLAIMER 


The reader is cautioned that computer programs developed in this research may 
not have been exercised for all cases of interest. While every effort has been made, 
within the time available, to ensure that the programs are free of computational and 
logic errors, they cannot be considered validated. Any application of these programs 


without additional verification is at the risk of the user. 
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I. INTRODUCTION 


A. PREFACE 

One of the main problems in experimental statistics and experimental design is 
the exploratory analysis of raw data. This problem is greatly enlarged when the data 
presented to the statistician comes from unknown multiple populations or so called 
multivariate data. A special case is that of panel data; multiple time series of short 
length. The initial purpose of the data analysis is to try to capture the most important 
distributional characteristics of the data such as the range, location and spread of the 
data points. For the experimental statistician the main tool available for the analysis of 
the data is the graphical display of the marginal distributions of the data in order to 
visualize and gain better understanding of these characteristics and to compare them 
against those of the different populations. Following this, interactions or dependencies 


can be examined, and this is the domain of multivariate data analysis. 


B. PURPOSE 

The purpose of this thesis is twofold: first, to add to the tabular display of the 
original multivariate data an existing graphical technique known as the BOXPLOT (see 
[Ref. 1] ). This addition can be done in several alternating ways and is done in order to 
better understand the populations and the relations between the different populations. 
The second purpose of the thesis is to make available different computer programs to 
exploit several other enhanced statistical graphical techniques for multivariate data 
analysis. These techniques are: STAR plots, PROFILE plots, CODiD СА 
plots and CODED DRAFTSMAN plots. 


C. BACKGROUND 

Presently, the BOXPLOT technique is one of the most common graphical 
techniques used by data analysts, both outside and at the Naval Postgraduated School 
(NPS). There are different software packages that provide these plots, some of which 
are in the experimental IBM APL GRAFSTAT program and some in the IBM 
Mainframe NONIMSL library. 

One of the most important limitations of this graphical display technique 1s the 


absence of the raw data in the display; this absence is critical in the special case of 
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multiple box plots for the comparison of multiple sets of data. This would provide an 
inmediate identification of peculiar characteristics of the data, such as pin-pointing the 
outliers and the variability and/or relation of a sample datum with respect to other 
samples. Once the BOXPLOTS are displayed on the screen (or printed on a graph), the 
analyst has to go back to the original raw data in order to identify these data points. 
This limitation is overcome by the new technique presented herein, which 1s called the 
BOXPLOTTED TABLES. This technique has already been implemented, and can be 
used in the NPS IBM 3033 computer using an APL (A Programming Language ) 
program, which make use of the graphical capabilities of the IBM experimental 
GRAFSTAT software package. In GRAFSTAT, an interactive technique for 
identifying odd or outlying points is given. This implementation highlights the 
importance of a technique for data point identification. However, one does not always 
have access to such a program and the ability to do this identification on a printed 
page is important to a data analyst. The BOXPLOTTED TABLES do precisely this. 
Note too that a primary concern in multivariate data analysis is to get as much 
information on a two dimensional page as possible. Thus having tabular and 
distributional data together on one page, as in the BOXPLOTTED TABLES, 15 а step 
in this direction. 

There are other graphical techniques commonly used by data analysts such as : 
SCATTER plots, STAR plots, PROFILE plots, CODED SCATTER plots and 
CODED DRAFTSMAN plots (see [Ref. 1] ). These techniques are mainly used to 
enhance the interpretation and understanding of displayed multivariated data. Out of 
these, the DRAFTSMAN and SCATTER plot techniques (without coded symbols) are 
the only ones available at NPS up to this point. These other techniques are used to 
display the data points in many different forms, giving a new perspective to the 
interpretation of the original data. 

This thesis present a group of APL functions that will make possible the use of 
these graphical techniques to the experimental statistician at the NPS. Various 
examples that show how to use this software to analyze and graphically display sample 


data will be shown in the following chapters of this thesis. 


D. ORGANIZATION 
This thesis consists of three main blocks. The first one, Chapter Two, 15 dedicated 


to explain the technical aspects of these graphical techniques; the mathematical and 


ІМ 


statistical attributes of each technique are treated in this chapter. The second block is 
intended to introduce the user to the APL software code used to implement these 
techniques. In Chapter Three, both the user and system requirements are explained; 
and for the more technical oriented reader, a listing of the APL code is listed in 
Appendix A. In addition, several examples of program execution are listed in 
Appendix B. The last block, composed of Chapter Four and Appendix C, is dedicated 
to the exploratory analysis of several sets of sample data to demonstrate some of the 


potential applications of these graphical approaches to statistical analysis. 
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Il. GRAPHICAL TECHNIQUES 


A. BOXPLOTTED TABLES 
1. Overview 

The BOXPLOT graphical technique was first conceived by Tukey as a method 
to display an almost one-dimensional summary of the distribution characteristics of a 
set of data, Chambers [Ref. 1] provides an excellent analysis of this technique. This 
display shows some of the most prominent characteristics of the sample distribution 
such as the median, mean, the inter-quartile range and the outlicrs, if there are any. In 
the case where the sample comes from multivariate data, the BOXPLOT 1s used not 
only to show the individual characteristics of cach subsample, but, in addition, to 
compare the behavior of these characteristics with respect to other samples (see 
ШЕРІ р. $9] ). Figure 2.1 shows a BOXPLOT display. The BOXPLOT's almost 
one-dimensional character, as opposed to the two-dimensional character of the familiar 
histogram, facilitates comparison of the marginal properties of multivariate data sets. 

One of the limitations of the BOXPLOT is that of the identification of specific | 
values of interest such as outliers; 1f the identification of the outliers is the prominent 
feature, the statistician must make reference to the original data in order to identify 
which data point the outlier correspond to. 

A solution to this limitation, suggested by Professor P.A.W. Lewis in an 
unpublished work, is to show the original data and the BOXPLOT in the same 
graphical tabular display. In this case, the BOXPLOTS are shown as dividers of the 
Original tabulated data, so that aberrations are readily apparent and checkable (see 
Figure 2.2), This technique clearly requires the availability of high resolution graphics 
and a sophisticated plotting and data manipulation package. This requirement 1s met 
by the experimental APL GRAFSTAT program from IBM Research which 1s being 
used at the NPS on a test bed basis. 

2. Technical details of BOXPLOTTED tables 

In the BOXPLOT the top and bottom of the rectangle represent the upper 
and lower quartile of the data respectively. Therefore, the length of the rectangle 
represent the inter-quartile range ( Q(.75) - Q(.25) = IQR ), where Q(@), for O<a< I 


is the d-quantile of the sample. The mean of the data sample is shown by a sinall 
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Figure 2.1 BOXPLOT of California ы Data (Per Capita, 
Hospital Expenses, Years 1971-1975, in 14 Ilealth Service Areas). 
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circle, and the median by an asterisk inside the rectangle. The solid lines going out 
from the top and bottom of the rectangle represent the adjacent values. These values 
are defined as those data points greater or equal than Q(.75) and less or equal than 
Q(.75) + IQR for the upper line, and those values less or equal Q(.25) and greater or 
equal than Q(.25) - IQR for the bottom line. Those data points that fall in the range of 
еее, TOR), (Q(.25) - ТОК*1.5)] or КОС75) + ТОК), (О(.145) + IQR*1.5)] are 
called outliers and are represented by small light circles. The data points that fall 
beyond the ranges of these outhers are called extreme outlicrs and are represented by 
small black circles. As an example, for normally distributed data, approximately 5 
percent of the points should be outliers and marked with light circles and only about 
0.5 percent should be extreme outliers (see [Ref. 2] ). 

To obtain a BOXPLOTTED table, a tabular display of the data is added to 
the BOXPLOT display. At the bottom of each column the estimates for the mean, 
median, variance and the rank correlation between that column and the next column to 
the right are listed. The estimators for these parameters are defined as follows : 


.th th 


Let Xij ООСС ООС оаа оао ава letn be the number Of 


values in each column. Then 


Mean; = ^i = У; Xij / n. (2:315 
The Median is defined as follows. Let MID; be n/2 if n is even, and the largest 


integer smaller than n/2 if n is odd. Then 


Median, = X (MID), if n is odd, or (2.2) 


Median; = X (MID;) E X (MID; DES (2.3) 


h 


If n is even, where X j represents the i column sorted in descending order. The 


estimator for the variance 1s 


Variance = 9; ( Aij - X y /(n-1) (2.4) 


Finally, the Spearman’s p (RHO) Rank Correlation coefficient is defined as 
follows: let A; and Y; be two sets of data, and let [R(X:)] and [R(Y:)] be the ranks of 
A: and Y: as compared to the others X and Y values respectively, for 1— Tm 
R(Xi) — lif Xi is the smallest of X,,X,,...,.4,,, and R(X) = БАЕ БЕСЕ 
smallest, and so on, with rank n being assigned to the largest of the X;. The same 
applies for R(Y:). When assigning the ranks, if a tie is found (when two or more 
sample values are exactly equal to each other, they are tied), assign to each tied value 
the average of the ranks that would have been assigned if there had been no ties (see 


[ef З: р. 2521). Them the-estimarondas 


р; = MiRGY - (n D/2IR(S - (n 12] / (n(n? - 1))/12}, Q.5) 
If there are no ties in the data. If there are ties in the data, then the estimator is 


Ds УКСОКСО) - п( + 1)/2)* (2.6) 


[V ROS. - n(n-€ 0/2712. (YR? - n(n 1/2182 . 

Once the BOXPLOTTED tables for thc original data are obtained, it 1s then 
possible to join with lines, values with the same rank (order in magnitude) within their 
corresponding columns. The statistician may select to use this technique when rras 
desirable to study any possible relation with respect to time among variables (as in the 
case of multiple short time series), or with respect to magnitudes (as in the case of data 
with mixed qualitative and quanttative information). 

If the data are ordered (in descending order on the first column), then the 
outlier in the ftrst boxplot corresponds to the first value in the table. However, this 
ordering may be lost in the second column, so that it is not clear that an outlier in the 
second boxplot corresponds to the first value in the second column and so forth. Thus 
if a line is drawn linking the largest value in each column, two extreme results are 
informative. If the line is straight (or almost straight), it means that the outliers in 
successive columns come from the same source. If the line wanders, then there is no 
structural relationship along columns (or time, if one considers panel data). As an 
example, the study of health care expenses is a good prototype of multiple short time 


series analysis. 
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Figure 2.3. BOXPLOTTED Table with Joining Lines 
(Per Capita, California Hospital Expenses, Years 1971-1975). 
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In Figure 2.2 one could draw a line joining the highest or lowest expenses 
through time and see to which hospitals they belong. The reader could now make 
reference to Figure 2.3. In this figure one could see the transition of health care cost 
for the most and least expensive health service areas in California through the period 
of 1971 to 1975. A plot with the connections rs shown m Figure 4.2. 

In addition to this option, the user can display the differences between the 
values of the columns. These differences could be relative to the first column (base 
column) or with respect to the previous one. Whien the statistician is dealing with panel 
data, it is desirable to study the trend of relative (or absolute) rate of change in the 
data ports. Again, m the analysis of health care expenses, one may want to study the 
relative (or absolute) rate of change in this cost through a given period. In Figure 2.4, 
it 1s possible to infer that the relative change of health care expenses is not linear 
within the period of study. This inference would be enhanced Бу а plot of differences, 
as is done in Chapter Four. It is also possible to readily identify those health services 
areas that had the extremes rates of change. An analysis of this data 1s presented in 


Chapter Four. 


В ЭКЕЕ 
1. Overview 

In working with multivariate data, one of the key problems is how to 
represent more than two variables (dimensions) in a single display. There are several 
graphical approaches to deal with this problem, as mentioned in. Chambers [Ret. 1]. 
Four of these techniques are treated in this thesis : STAR (ог, ТКО ЕШ ШИ 
CODED SCATTER plots and CODED DRAF TSIEN plots. 

In the STAR plot each subpopulation ts displayed by a star m which each arc 
(or ray) represents a variable of interest. The value of each variable tis coded by the 
length of the corresponding arc; to avoid overlapping between arcs, these are portrayed 
symmetrically about the origm. This can be seen in Figure 2.5 and Figure 2.6, in which 
some characteristics of automobile data are displayed (a complete description of the 
data i$ presented in Chapter Four) In Figure 2.5, twelve variables ormi otes 
assigned to the rays of the star (ie. price, length, cte) In. Tigure 2*6 thes а 
representation is used to portray the same information but for several automobile 
subpopulations (models). It is now easy to graphically compare these characteristics 


(by the corresponding length of each ray) among the different subpopulations (models). 
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Figure 2.5 Assignment of Variables 
to the Rays of the STAR (Automobile Data). 
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Figure 2.6 STAR Plot of the Automobile Data. 
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2. Technical details of STAR plots 
There are two essential features in the construction of the STAR plot; the 
lengths of the rays and the angle between the rays. As stated in the last section, the 
value of the variables are represented by the length of the rays; therefore, these values 
should be non-negative and be represented using a similar scale. This is accomplished 


by initially rescaling the value of the variables using the following formula : 


Xy 7 ((1-c X; - min, ) / ( Max; - min; )] + с, (2.7) 


where c is a constant and is usually given a value of zero; and Ха represents the ith 


h 


observation of the j' variable. The coefficients min; and Max; represent the minimum 


) 
and maximum value of the jth variable respectively. Once the rescaling of the variables 
is performed, the angle between the rays must be determined. The first variable 
(variables are enumerated in increasing order) is plotted on the horizontal axis at an 


h 


angle of zero degrees. Then the jt? angle between the remaining variables is calculated 


using the following formula : 


o ue rne | (2.8) 


h variable. The 


where n represents the number of variables (parameters), and | 15 Ше i 
rays are then enumcerated from 2 to n and displayed counterclockwise. Finally, the szar 
is constructed by joining the end points of the n rays. The end point of each ray is 


calculated by the following formula : 


Р; = (Ху Reosm, Xj R біп оз), (2.9) 


th 


where] = 1,2,.., n^" variable, and R is the maximum allowable radius of the star. 


C.. -PROFILE ECOTS 
1. Overview 
The PROFILE technique is similar in nature to the STAR plot, the only 
difference is that in the PROFILE plot the rays are displayed by equidistant vertical 
lines arising from a common horizontal axis. In fact, as stated in Chambers [Ref. |: p. 


159], the STAR plots are actually PROFILE plots conceived in polar coordinates. In 
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the PROFILE plots, the values of the variables are used to control the length of the 
ends of the connected line segments (see Figure 2.7 and Figure 2.8). 

One of the possible advantages of the PROFILE plots over the STAR plots is 
that in the former it is possible to represent variables with negative values. Since in the 
PROFILE plots all value-vectors are displayed with respect to a horizontal axis, it 1s 
then easy to show variables with negative values. The base line, the horizontal axis, 15 
used to represent zero and negative values are displayed by lines dipping below this 
line. In the STAR plots this is not possible. 

2. Technical details of PROFILE plots 

In the PROFILE plot the rescaling 1s performed using the same formula as for 

the STAR plot. Negative values of the variables are allowed by using the following 


rescaling formula: 


и. 10 
Xy 7 (Xy / Max; ). (2.10) 


D. CODED SCATTER PLOTS 
1. Overview 
The CODED SCATTER plot is an enhancement of the most commonly used 
technique namely a SCATTER plot for two variables. Using this coding technique it 1s 
possible to represent more than two variable (dimensions) in the same display. 
Different symbols, and sizes and colors of these symbols, are used to represent three or 
higher dimensional data. The size and color of the symbols could be used to control 
different ranges of data values. 
2. Technical details of CODED SCATTER plot 
The CODED SCATTER plot uses essentially the same plotting technique as 
the usual SCATTER plot; only coded symbols, sizes and colors are added. This 1s in 
line with the need to represent as many dimensions as possible from a multivariate data 
set on a two dimensional graph. Thus, in a CODED SCATTER plot the position of 
Шөйроіп(5 іп the graphical plane are represented by the (X,Y) values of the two 
variables. Then, if X is the miles per gallon variable in a data set and Y is the price of 
the car, plotting Y vs X shows how gas consumption increases or decreases with 
Increasing cost of a car, or that there is a much more complex relationship between the 


two variables, or that there is even no relationship at all. However, there are other 
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Figure 2.7 
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Figure 2.8 PROFILE Plot of the Automobile Data. 
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variables or factors involved in the relationship. These may be either continuous, 
discrete or categorical factors. An cxample of the first is the weight of the car, an 
example of the second is the number of cylinders in the car. A categorical factor is 


Origin, 1.e. whether the car is domestically produced or not. 


USA = A, FOREIGN = F AND WEIGHT = SIZE OF LETTER 
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вы 29 CODED SCAT RFO othe Automobile Data. 
Price vs AEP O CIty). 

е shows a CODED SCATTER PLOW of the car price variable, X, 
versus the miles per gallon variable, Y. The best way to code the origin of the car is by 
using colors; however, due to reproduction problem, this has been encoded as symbol 
Ше 1 he weight of the car is coded as the size of the symbol. in Figure 2.9, one can 
see that increasing price gives lower m.p.g., although the relationship 15 far from lincar. 
The other factor is weight; weight clearly increases with price, also m.p.g decreases with 
weight. Again, referring to the categorical variable, origin, American cars cost more 
than foreign cars, weigh more and get less mileage, although there is interaction and 
overlap between all of these variables. There are also a few outliers. The very high 
mileage, low cost, and light weight car is the VW Rabbit (Dicsel) and the very heavy, 


low nuleage, and high cost car is the Cadillac Seville. 
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Е. CODED DRAFTSMAN PLOTS 
1. Overview 

The DRAFTSMAN plot is an arrangement of SCATTER plots in which any 
adjacent pair of plots have a common axis (see [Ref. 1: p. 145] ). In this way, the 
practitioner can observe the relationships of the variables within a specific plot and, in 
addition, can follow any particular observation (or group of observation) through the 
sequence of plots. Therefore, the analysis of multiple interactions among the variables 
is possible. This DRAFTSMAN plot can further be enhanced by portraying one or 
several additional variables by the assignment of symbols, sizes of the symbols, and 
colors to the already displayed variables. This is the main idea behind the CODED 
DRAFTSMAN plot, in which the techniques used in both DRAFTSMAN and 
CODED SCATTER plots are combined to display a single plot. 

2. Technical details of CODED DRAFTSMIAN plot 

The CODED DRAFTSMAN plot, as mentioned earlier, can be seen as a 
displayed array of several CODED SCATTER plots. In certain situations the 
SCATTER plots may be deceiving due to the overlapping of data points. This situation 
may require to jitter one or more variables in order to alleviate this problem. Also, the 
practitioner may want to transform the data in order to achieve a simple and more 
understandable picture and in this way facilitate the analysis of the relationships 
among the variables (sce [Ref. 4] ). Another technique used by statisticians to reduce 
the spread in the data, and to enhance the visual interpretation of the plots is to 
smooth the data , relying on the Moving Averages technique or the Locally Weighted 
Regression (LOW ESS) for this purpose. For further explanation about these two 
techniques see Moran [Ref. 5]. These techniques (jittcr, transformation, and smoothing), 
are included in the CODED DRAFTSMAN program presented in this thesis, and they 


can be invoked interactively. 
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IIl. COMPUTER PROGRAMS : USER INSTRUCTIONS AND 
TECHNICAL DESCRIPTION 


А. GENERAL 

This chapter provides detailed instruction on how to use the computer programs 
presented in this thesis. These programs were written in APL and are designed to be 
used in conjunction with the experimental IBM graphical software GRAFSTAT. All 
these programs are interactive and all user defined parameters and options selections 
are entered in response to program queries. Although no APL skills are required to 
operate these programs, it is recommended that the user becomes familiar with APL 
system commands and procedures to load and copy workspaces, groups and variables, 
and to understand the meaning of workspace, variables, groups and vectors in the APL 
terminology. The user should read VS APL AT NPS , [Ref. 6] before attempting to use 
these programs. For the experienced APL user it will be easy to make changes to these 
programs in order to accommodate any additional requirement. 

These programs were designed to be used on the IBM 3033 computer and to be 
executed using an IBM 3277/TEK 618, 3278/3279 or 3179G2 graphic display terminals 
using a memory capacity of at least 2 Megabytes. 

All of these programs are contained within an APL workspace called 
APLGRAFS, and are organized in different groups of functions (each group contain 
those functions related to a specific program application). The reader can find a list of 
Groups and the content of each group of functions in Appendix A2. In order to make 
use of these programs, the user must have access to this workspace and to the APL 
workspace called GRAFSTAT. 

There are two ways of executing these programs. The first one is described in the 
following steps : 

1) LOGON to the system. 
O Once in CMS, enter APLGST. 
ШЕ At the prompt CLEAR WS, type )LOAD ОКАЕЗ ТАГ. 


4) Enter ) PCOPY APLGRAFS groupname, where groupnaine is one of the groups 
listed in Appendix A2. 


5) Enter the name of the desired program to be executed (Nemes LAN LO leaned 


+ 


then answer the queries. 
The second mode is more user-friendly. The steps that must be followed are : 
1) LOGON to the system. 
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2) REGI S MEE APLGRAFS, (this will cause the execution of the macro 
APLGRAFS EXEC), then you will see a menu describing all the available 
programs (sce Figure 3,15): 


3) After you center the number corresponding to the selected program, you only 
have to follow the instructions given on the screen (this macro will execute 
steps 2), 3), and 4) of the previous list for you). 


FILE: MENU 


YOU HAVE THE FOLLOWING PROGRAMS TO USE 


(1) STAR AND PROFILE PLOTS 

(2) BOX PLOTTED TABLES 

(3) SYMBOLIC SCATTER PLOTS 

(4) DRAFTSMAN DISPLAY 

(5) LOWESS 

(6) EXPLANATION ON THESE FUNCTIONS 
(7) QUIT 


TYPE THE NUMSER CORRESPONDING TO THE PROGRAM YOU WANT 





Figure 3.1 Menu Presented by «РЕСЕ ЕЕ 


B. PROGRAM DESCRIPTION 

In order to use any of these programs, the user will need a matrix containing the 
sample data. This matrix could be in a CMS file, or could be a character array in an 
APL workspace. The programs will accept the data in either way; just follow the 
instructions given by the program as to the location of the data set. In addition, the 
user will need an APL two-dimensional character array containing the names of the 
variables which will appear in the display. These names are the labels which will be 
shown on the axis of the plots or in the rows and columns of the tables as in Figure 
2.2 and 2.4. If the user has not previously created this array, the programs will allow 
the user to enter the labels directly in response to a sequential series of queries. At this 
point, the user is ready to execute any of the programs. 

When answering the queries, if the user enter an erroncous response, the program 
will prompt the user to enter the correct response only in the case of a YES or NO 
question, a range question (i.c., 3,4, or 5 plots), if the name of any APL matrix does 
not exist in the workspace, etc.; in all other cases, the program does not faveuams 


means to know the validity of the response so the program) will accept any response as 
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a correct one. When using these programs, if the user wants to cancel the execution at 
any time during the execution, the user must hit the PA2 key. 
1. BOXPLOTTED TABLES 

This program is executed by entering the command BOXPLOTAB. Once this 
command is entered, the program will start running by prompting the user with a 
sequence of queries indicating the user to enter the input arrays and to select the 
different available options (see Appendix Bl, for an example of program execution). 

a. Input requirement. 


(1) The array containing the sample data, the array containing the names (labels) 
of the columns , and the array containing the names lee of the rows. 


(2) The title of the display. 
b. Options. 


(1) The data could be. displayed as originally entered or could be displayed 
ordered (sorted) by the first column. 


yee Once the BOXPLOTTED Tables are shown оп the screen , the user will be 
prompted as to whether or not he or she wants to join the data points of the 
Same position with hnes. The position is given by the order of the data points 
of the first column (see Figure 2.3). 

(3 After finishing with the previous displav, the user will be шше whether or 
noie Or she wants to see BOXPLOTTED TABLES oí the dillerences 
between columns; and if it 1s so, whether absolute or relative differences are 
desee hesdslerence could be calculated as follows : diflerence between all 
other columns with respect to the first one, or difference between adjacent 
columns (see Figure 2.4). 

2. STAR PLOTS and PROFILE PLOTS 

These two programs are executed by entering the command STARPLOT. The 
program will start running and the user will be asked to enter the desired function : (S) 
for STAR PLOT or (P) for PROFILE PLOT (see Appendix B2, and B3 for an example 
of the execution of this program). 

a. Input Requirements. 

(1) Same as for the BOXPLOTTED TABLES. 

b. Options. 

(1) Whether the whole original data is to be used or just a subsample of the data. 
Ihe subsample could be constructed by selecting specific columns and/or 
rows. 


(2) The user will be asked how many plots per screen are desired. This could be 
ор рег ѕсгеер. 


3. CODED SCATTER PLOT 
The execution and the input requirements of this program are similar to that 
of the STAR PLOT. To execute the program enter the command SCATPLOT (see 


Appendix B4 for an example of the execution of the program). 
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a. Options. 


The user will be asked to enter the title for the screen and the title for each 
РІО ОПИС Опсе 


For cach plot, the user must enter the column to be used on the X and Y axis, 
and whether or not the entire data or a subsampie of it 1s desired. 


Another option is to select whether the data is to be jittered or if a 
transformation of the data (specified by the user) is desired. 


Following this option, the user must select the position of each plot on the 
screen 1. 21,220 во 

Finally, the user must specify the symbols, colors and sizes of these symbols 
that will be used to FCRI E an specific subset of the data. [f the user selected 
one plot per screen, the program will ask for a small description for each one 
of these subsets or categories. These subsets are defined Using 218 
statements. The user can specify more than one subset in the same plot and 
more than one plot per screen (see Figure 2.9). 


. CODED DRAFTSMAN PLOTS 


This program is executed by entering the command DRAFTSMAN. Once this 


command is entered, the program will start by prompting the user with a sequence of 


queries indicating the user to enter the input arrays and to select the different available 


options (see Appendix B5, for an example of program execution). 


(1) 


(1) 
(2) 


а. Input requirement. 


The array containing the sample data, and the array containing the names 
(labels) of the columns. 


b. Options. 


The data could be used as originally, entered, or could be Jittered or 
transformed. Vhe user must select the desired option. 


Select whether or not a smoothed curve will be fitted to the data in all plots 
on the screen. If the smoothed curve is selected, the user must indicate 
whether the Moving Average or LOW ESS technique will be used. 


Select_ between using the CODED DRAFTSMAN ог the regular 
DRAFTSMAN plot. IT the former is selected, the User must enter an ie 
expression, a symbol and its size, and the color for each category to be 
represented. 


Select the number of Зее desired per screen (the available options are 3,4 or 
5 rows and columns of plots per screen). 


Once the display is shown on the screen,and if the answer to option (2) was 


no, the user now has the alternative of fitting a smooth curve to the data of 
an specific plot. 
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IV. DATA ANALYSIS 


A. GENERAL 

The primary purpose of this chapter is to demonstrate the applications of the six 
graphical techniques presented in this thesis, namely BOXPLOTTED tables, STAR 
plots, PROFILE plots, CODED SCATTER plots and CODED DRAFTSMAN plots 
in the analysis of multivariate data. An attempt is made to highlight different 
peculiarities on the sample data that could be found when the practitioner uses these 
techniques; therefore, a full analysis of the various samples 1s not envisioned. However, 
it will be seen, that in using this techniques one can draw solid conclusions about 
certain behavior of the characteristics of the population from which the sample 1s 


drawn. 


В. AN ANALYSIS OF HEALTH CARE EXPENSES 

The following type of data represent a good example for which the statistician 
can make use of the BOXPLOTTED tables and the PROFILE plots. This is a sample 
of panel data and represents the health care cost (per capita hospital expenses) of 14 
health service areas through the State of California from the years of 1971 to 1975. 
Figure 4.1 is a BOXPLOTTED table which displays the average health care expenses 
of the areas. 

The data was formatted as a two dimensional array of 14 rows and 5 columns. 
Each row of the array represents the average health care expenses of a given area, and 
each column corresponds to the average expense for a given year. The data have been 
ordered in decreasing order by the first column (year), i.e., the service area with higher 
health care expenses on the first year correspond to the first row and so on. In general, 
the boxplots in Figure 4.1 give an initial impression of the distribution of each 
subsample data. Notice that during the first three years the tendency of the distribution 
is definitely skewed to the right, caused by some possible outliers, indicating that some 
service areas far exceed the average health care expenses. However, in the last two 
years the tendency is the opposite, with again the exception of some possible outliers. 

The initial impression given by the boxplots could further be exploited by an 
analysis of the flow of the data in order to study the trend of the mean health care 


Во спе Variance of health care cost, and to identify the occurrence, or recurrence, of 
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possible outliers. Another possible trend that could be study is that of the relative 
difference in health care expenses. One further arca of interest is to study the trend of 
the most and less expensives service areas through this period. 

The tendency in the average health care cost during this period was expected to 
be an increasing one (during this time, among other things, the inflation rate was 
starting to increase very rapidly). This tendency can be seen in Figure 4.1. Notice that 
this change is apparently quite linear up to the year of 1974; in 1975 there 1s a big 
jump in the average cost which probably indicates that, overall, the trend in the 
average health expense through this period was not linear. This same tendency is 
present in the variance of health care cost, which seems to confirm the nonlinearity in 
the average health care expense during this period. In this figure and in Figure 4.2, 
where lines are used to trace the flow of the 1971 high and low cost areas through 
subsequent years, it is also possible to readily pinpoint those service arcas of extreme 
average health care cost (possible outliers, as defined in Chapter II). Notice that, the 
health service area number 4 is shown as a possible outlier through all years; it 1s 
always at least 2.20 from the mean cost. The service arca number 3 has the same 
tendency. These two areas are then the possible cause in the high variation observed in 
the health care cost through this period. They actually represent the Los Angeles and 
San Francisco metropolitan areas. In Figure 4.2, one could also follow those service 
arcas with lower average cost (these areas are joined by line segments at the bottom of 
the display), it seems that these areas (number | and 14) had the lowest cost through 
this period, with the exception of service area no. 8 which has the lowest cost in 1971. 

One could further follow the trend in the change of health care expense for each 
respective service area by using the PROFILE plots. In Figure 4.4 each one of the 
profile plots portrays the values of cach row (health care service area), and the values 
are ordered by the magnitude of the first column, as in the BOXPLOTTED tables. The 
values of each column are represented in each profile according to the assignment 
given in Figure 4.3. In Figure 4.4 it is quite casy to identify the health service areas 
that had the highest and lower health care expenses during this period; as 1t was seen in 
the BOXPLOTTED tables, these areas are number 4d and 3, and number 1 апа 14 
respectively. One could also readly pinpoint the area with more variability in health 
care expenses, in this case notice that area number 13 has greater change in health 
expenditure than areas number 1, 14 and 4 (this last being the most expensive). Notice 
that the highest variation in expenditure in area number 13 takes place during 1972 and 


1973. This fact could also be capture in Figure 4.5 in columns 2 and 3. 
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Figure 4.3 Assignment of Variables to the Profile of the 
Per Capita "Hean" Care Expenses: 





Figure 4.4 Profile Plot of the Per Capita Health Care Expenses 
California Efealth Service Areas. 
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Now one could also reinforce the statement about the nonlinearity in the change 
of the health expenditure by looking at the relative difference in this variable through 
this perrod. Figure 4.5 portrays the trend in the relative difference in health expenditure 
with respect to the first year of study (1971). In Figure 4.6 one could see the some 
trend but now with respect to the previous year. Figure 4.6 definitely shows that the 
change in health care expenses has a nonlinear behavior. It is changing linearly during 
the first three years, and then at an accelerated, possible quadratic rate, from then on. 
The same trend seems to be shown in Figure 4.5, this trend is highlighted in the last 
column, where the mean of the relative differences jump from 48.91 to 76.50. 

In Figures 4.5 and 4.6 it 1s also possible to identify those services areas that have 
the maximum and minimum relative change. As an example from 1971 to 1972 area 
number 8 had the maximum positive increase and from 1972 to 1973 the area with the 


maximum positive change was area number 4. 


С. AN ANALYSIS OF THE NEW YORK STOCK EXCHANGE 
In the previous analysis, the data considered consisted of the same type of 

commensurable values; 1.e., dollars through a period of time (one could consider this as 
being multiple short trme series data). In contrast with this type of data, the 
practitioner can encounter multivariate data that represent different qualitative and 
quantitative magnitudes. One example of this type 1s the data obtained from the stock 
markets in the United States. Here again, the practitioner can make use of the 
BOXPLOTTED tables as a tool for data analysis. The data to be analyzed was 
extracted from the New York Times, representing the most active stocks (measured by 
the number of shares traded) in the New York Stock Exchange for the week ended on 
August 8, 1986. The data is initially formatted as a two dimensional array consisting of 
40 rows (representing each of the different trading companies) and 6 columns. Each 
column correspond to the following variables. 

(1) Volume of shares traded during the week (in 100,000 units). 

(2) Closing price at the end of the week (in dollars). 

(3) Price change during the week {in percentage). 

(4) Price change during the last 12 months (in percentage). 

(5) Earnings per share during the last 12 months (in dollars). 


(6) Earnings per share during the last 12 months (in percentage). 
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California Health Service Areas (First Year). 
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Figure 4.7 shows the initial distributional characteristics (in the form of boxplots) 
of each subsample data. One of the first visual messages from these plots are the 
outliers in cach column. Here is where the power of this new graphical technique lics: 
onc can easily identify those possible outliers by looking at the tabular data adjacent to 
the boxplots; although this is easicr in the first column since the data is ordered in that 
column. As an example, looking at the first boxplot and the first column, it 1s easy to 
identify Owen Corning and the Mobil Corp. as those companies traded by these two 


companies is greater that 2.70 of thc average column traded. 
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Figure 4.7 40 Most Active Stocks for the Weck Ended August 8, 1986 
(New York Exchange). 

Another observation that can be made from this figure is the absence of 
statistical corrclation among the variables when these are compared in the order shown 
in Figure 4.7. In this case, the sample serial rank correlation are obtained by 
comparing the adjacent columns. Looking at the sample serial rank correlation, onc 
could conclude that there is no statistical relationship between, as an example, the 


volume of shares traded and the price at which the share closed at the end of the weck. 
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Figure 4.8. 40 Most Active Stocks for the Week Ended August 8,1986 

(New York Exchange) with Lines Connecting the Two Higher Stocks. 
The only positive indication is a relationship between percentage change during the last 
week (column 4) and percentage change in the last year (column 5). One can visually 
confirm this lack of correlation by identifying the maximum and minimum values of 
adjacent columns. For example, the Am Motor Co. shows to have the lowest volume 
of shares traded during that week but the LTV Corp. had the lowest close price. 

Notice that one is not only interested in the stock which is the most active during 
the week. One is also interested in which stock has the greatest (absolute or relative) 
change in price, and whether this is related to other factors like earnings (absolute or 
relative). With this 1s inind, it is possible to follow those stocks that have the largest 
value in each of the variables considered tn the analysis. Figure 4.8, shows the two 
stocks which have this characteristic. These stocks are joined by line segments. [t is 
easy to see that the Owens Corning Co. is the stock with the highest volume of shares 
traded during the week and also the largest price change (in percentage) during the last 
twelve months; also, the IBM Co. has the highest closed price at the end of the week 


and has the second largest earnings per share (in dollars) during the last year. Likewise, 
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the practitioner could follow the trend of those stocks with the lowest value in each of 
the variables considered, or even the mid-point values. 
Differences have no meaning here but it is interesting to trace the movement of 


the most active stocks (in volume) to other indicators (columns). 


О. AN ANALYSIS OF AUTOMOBILE DATA 

The purpose of the following analysis is to try to explore some important 
descriptive characteristics of different types of automobiles and an attempt is made to 
find any relation between these characteristics. As it is shown in this analysis, the 
SIARPLOTS, the CODED SCATTER plots and th CODED DRAFTSMAN plots 
techniques are paramount experimental statistical tools in this type of analysis. It is 
appropriate to mention at this time that one other author has previously made use of 
the data treated here and has written an outstanding analysis (See [Ref. 4] ). The 
purpose here is to demonstrate how one can convey to the same general conclusion 
using these new techniques. The new technique is the enhancement of SCATTER and 
DRAFTSMAN plots by coding in other variables. 

The data represent three general categories of quantitative and qualitative 
characteristics of American and Foreign automobiles of 1979 (the data was obtained 
from the Consumer Report Review). These categories are: performance, dimension and 
price. The variables under these categories are as follows. 

In category one : mileage in miles per gallon, repair records for 1977 and 1978 
(rated on a 5 points scale; S= best and 1= worst), turning diameter (clearance required 
to make a U-turn) in feet, gear ratio for high gear. 

In the second category : headroom in inches, weight in pounds, length in inches, 
displacement in cubic inches. 

And under the last category: price in dollars. This data was initially formatted 
into a two dimensional array consisting of 74 rows (name of automobiles) and 13 
columns. Each column corresponds to each one of the variables mentioned above, and 
the last column correspond to an ordinal variable to denote American or Foreign car. 
This variable has been added to the original data to demonstrate one of the many 
Bessivole application of the CODED SCATTER plot and of the CODED 
DRAFTSMAN plot introduced in this thesis; as an example, one can readily identify if 


a certain deviation from a possible pattern is due to American or Foreign cars. 
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Figure 4.9 Assignment of Variables to the Rays of the Star 
Automobile Data. 
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Figure 4.10 STAR Plot of Automobile Data, 10 Lighter and 
the 10 Heavier Automobiies. 
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As an initial starting point for this analysis, one could study the characteristics of 
each one of the individual automobiles. The STAR plot technique was choosed for this 
purpose. Figure 4.9 shows the assignment of the twelve characteristics to the rays of 
the star. In the study of this type of data, it is interesting to highlight the favorable 
characteristics of each automobile. So as in Chambers [Ref. 1] the larger the ray of the 
star the more positive that attribute is to the respective automobile. To make price, 
turning diameter and gear ratio favorable, these variables were multiplied bv -1 (1.e., 
the larger the ray corresponding to price, the less expensive the car is). The star 1s 
arranged in such a way that the statistics corresponding to cost and performance 
categories are rising upward and horizontally, and those rays pointing downwards 
correspond to variables closely related to the dimension of the automobiles. Appendix 
C shows the complete STAR Plots for the 74 automobiles. 

The array of stars are ordered by the weight, the first and last stars 
corresponding to the lightest and heaviest automobiles respectively. Figure 4.10 
displays a summary of Appendix C, showing the 10 heaviest and 10 lightest 
automobiles. The idea behind this arrangement is, as commonly accepted, that weight 
15 positively correlated to safety. Note the switch between the first (IIonda Civic) and 
last (Lincoln Continental). For the first of these all positive values are above the line; 
for the latter this is switched. Note too that the variable of greatest interest to 
Consumers Reports, Repair 78, is the vertical ray. 

ioericure 4.10) it is easy to see that nine out of the ten lighter cars, in the top 
panel, are of foreign make, the exception being the Ford Fiesta. Also, that the 10 
heavier cars are Americans. From the STAR plot of Figure 4.10, one could also 
compare other characteristics among these automobiles. As an example, in terms of 
price variable alone, notice where in this case, that among the 10 heavier cars there are 
4 American cars that are inexpensive compared with most of the lighter foreign cars 
(these American cars are the Mercury Cougar and Cougar XR-7, Buick Electra and, in 
lesser way the Oldsmobile 98). Also in terms of repair records (of 1977 and 1978), those 
American cars among the heavier ones compare with those foreign cars among the 
Other group. The information is abundant in these plots. However, when there are 
many variables involved in the analysis it is questionable whether the practitioner can 
actually capture the behavior of one variable alone or the joint bchavior of two or 
more variables. As in this case one would like to see if there is any relation (linear or 


Other type) between price and weight or, say, displacement and price (it is difficult to 
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identity any deviation, if it exists, from a possible relation in the STAR plot). In this 
type of situation, once the practitioner has an initial impression of the data, it 1s now 
the time to make use of other exploratory data analysis technique, such as CODED 
SCATTER plots and CODED DRAFTSMAN plots. 

To continue the analysis it was desired to study any possible rclation among the 
price, mileage per gallon, weight and displacement of the automobiles and to compare 
how American cars stand against the Foreign cars. The relations between these 
variables are examined in Figure 4.11 by using a CODED DRAFTSMAN plot. It was 
expected to see positive correlation between displacement and weight. This can easily 
be secn in the plot position 2,2 of Figure 4.11. The two possible outliers in plot 2,2 of 
Figure 4.11 show that there are two American cars that stand favorably among all 
others. They are lighter cars with high displacement. From the figures in Appendix C 
these two automobiles were identified as the Chevrolet Chevette and the Buick Opel. In 
terms of price, it 1s also possible to conclude from plot position 3,5 of Figure 4.11 that 
there is a negative relation, as expected, between price and weight. Notice, in the plot 
position 2,1 that there scems to be two types of subsamples within the data, one of 
foreign cars and the other of American cars (the foreign cars standing favorably against 
the American ones); however, both subpopulations have the same trend, namely, that 
weight increases with price. There are a couple of interpretations of this plot, beside the 
obvious dichotomy between American and Foreign autos. One is that if you want a 
heavy car, you will have to pay more if you also want it foreign made. 

An expansion of Figure 4.11 is given by the CODED SCATTER plot, wien 
have additional variables coded in as symbol type and size. Looking at price versus 
m.p.g., in the CODED SCATTER plot of Figure 4.12, one can confirnythemedeasiaam 
the higher the price of the automobile the less miles per gallon is expected. Notice, that 
with this figure, it is possible to analyze four variables at the same time : price and 
miles per gallon being the axis and weight and nationality the coded variables. It is 
interesting to notice that one American and one Foreign car tend to deviate from the 
norm. The American cars is the Cadillac Seville, with very high price, quite heavy, but 
a good relative mileage; the Foreign, being the V.W. Rabbit (Diesel) is at the opposite 
site of the spectrum. In the middle of the plot is a medium price, foreign car with very 
good mileage. This is a BMW 3201. 
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Figure 4.11 SOR Oe DRAFTSMAN Plot, of Automobile data 
American, F = Foreign. 
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Figure 4.12 CODED SCATTER Plot of Automobile Data 
Price vs MPG (A = American, F= Foreign and Size = Weight). 
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Е. AN ANALYSIS OF CONTRACT DATA 

The purpose of this analysis is to demonstrate other possible applications of the 
CODED SCATTER plot graphical technique as a toot in the exploratory data analysis. 
It is also appropriate to emphasize that the data considered in this section has been 
amply analyzed by other authors ( [Ref. 4] ) and again the purpose is only to highlight 
the use of this mentioned graphical technique. 

The data consisted of 177 contracts (rows), which were authorized by the 
Department of Defense during the period of 1949 through 1963. The cotumns consist 
of 11 variables of possible interest to the Department of Defense on how they have 
interfaced on a contractual level with the private sector of manufacturers. The data 
represent contracts let with 23 major contractors during this period, and includes 
information concerning 7 types of manufacturer products, ranging in complexity from 
drone aircrafts to missiles and helicopters. The 11 variables are listed below: 

(1) Deviation from target cost (percent). 
(2) Months to comply a contract. 
(3) Target profit of manufacturer (percent). 
(4) Sharing ratio (percent). 
(5) Ceiling price (percent of target price). 
(6) Target cost. 
(7) Number of items produced in the contract. 
(8) Number of contracts let that year. 
(9) Year the contract was signed. 

(10) Contractor awarded the contract. 

(11) Type of system. 

Due to the diversity of the data and the purpose of this section, it was decided to 
narrow the objective of the analysis to a single issue, which is probably the most 
important to the Department of Defense: an attempt will be made to see if there is any 
increase (or decrease) in the deviation from the manufacturer target cost through time. 
The one deviation that 1s considered to be the most significant will be the positive one, 
since this phenomenum would represent additional expenditure to the government. 
Thus, the task 1s to try to find a possible cause to this increase. 

Among the other 10 factors, it was hypothesized that the year in which the 
contract was signed and the time (in months) to complete the contract had significant 


influence on the deviation from the manufacturer original target cost. The variable year 
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signed was considered since the period of study includes an event that had significant 
impact on the US economy: the Korean War; therefore, it was expected that smaller 
contractors, not really prepared to react to the contingency of war production, would 
be less capable of making accurate predictions. Figure 4.13 shows the display of the 
year the contract was signed versus the deviation from target cost. It was also 
expected that the contractor, increasing from normal productions, would also be 
affected in their prediction capabilities. The hypothesis about the time to complete the 
contract is based on a simple idea : the wider the interval of time for which the 
prediction is made, the less is the probability of asserting the prediction. It was also 
desired to sce if the major trend in this deviation of the major contracts, since these 
were probably of greatest interest to the government. [n Figures 4.13 and 4.14 three 
major contractors were selected as been of relative importance: Lockeed, Douglas and 
Grumman. These three are coded by the initial letter. lt is casy to see that the actual 
year that the contract was signed does not really influence the deviation from target 
cost; the deviation are evenly distributed across the period of interest. However, notice 
that during 1951 and 1952 (Korean War period) the deviation are mainly on the 
negative side (probably the significance of patriotism) and thereafter are evenly 
distributed. | 
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Figure 4.13. Year Signed vs Dev. From Target Cost, Contract 
Data (L= Lockeed, G= Grumman, D = Douglas, o = Others). 


The other variable of interest was then considered, namely the time to complcte 


the contract. The range of this variable is from around 15 months to 130 months. 
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Figure 4.14 Months to Complete vs Dev. Target Cost, Contract 

Data (L7 Lockeed, G7 Grumman, D= Douglas, o = Others). 
ligure 4.14 shows the plot of those contracts that took less than 40 months, 40 months 
and less than 70, and 70 or more months versus cost deviation respectively. It is clear 
that there is some form of positive relation between cost deviation and those contracts 
that took more than 70 months to be completed; confirming the initial hypothesis. 
There are some exception to this conclusion, and these are mainly contracts that were 
given to two of the largest contractors, Lockeed and Grumman, and possibly two 
smaller ones. 

These plots, in a clear way, demonstrate the care that must be taken in the 
analysis of single scatter plots, one scatter plot portrays only isolated relationship of 
two variables and may not indicate a casual relationship. One should make use of 
different exploratory data analysis techniques in an attempt to discover possible trends 


in the data being analyzed. 


APPENDIX A 
COMPUTER PROGRAMS 


1. APLGRAFS EXEC. 
This exec program present a menu with all the programs available in the APL 
workspace APLGRAFS, after the selection 1s made the exec will load the necesary 


workspaces and will prompt the user to enter the name of the selected program. 


&TRACE 

SET BLIP ж 

-ОМЕ 

CLRSCRN 

&TRACE 

&TYPE 

&TYPE YOU HAVE THE FOLLOWING PROGRAMS TO USE 
&TYPE 

&TYPE (1) STAR AND PROFILE PLOTS 
&TYPE (2) BOX PLOTTED TABLES 

&TYPE (3) SYMBOLIC SCATTER PLOTS 
&TYPE (4) DRAFTSMAN DISPLAY 

&TYPE (5) LOWESS 

&TYPE (6) EXPLANATION ON THESE FUNCTIONS 
&TYPE (7) QUIT 

&TYPE 


&TYPE TYPE THE NUMBER CORRESPONDING TO THE PROGRAM YOU WANT 
&READ VAR &OPT 

&IF &OPT = 7 &GOTO -FINAL 

&IF ФОРТ < 1 &GOTO -ERRORI 

&IF &OPT > 6 &GOTO -ERRORI 

ж СР DEFINE STORAGE 2048K 

ж &STACK I CMS 

CP TERMINAL APL ON 

&STACK JLOAD GRAFSTAT 


&IF &OPT = 2 &GOTO -TWO 
&IF &OPT - 3 &GOTO -THREE 
&IF &OPT - 4 &GOTO -FOUR 
ЗІҒ ФОРТ - Б &GOTO -FIVE 
ТР ФОРТ = 6 &GOTO ~SIX 


&STACK ‘NOW LOADING , DONT TOUCH YOUR KEYBOARD' 

&STACK JPCOPY APLGRAFS GSTARPLOT GDEMO 

&STACK JPCOPY 990 CMSIO 

&STACK ' ' 

&STACK 'FOR A DESCRIPTION OF THESE FUNCTIONS TYPE : ' 
&STACK ' ' 

&STACK 
&STACK 


я INSTRUCTIONIONS ' 

а 
&STACK ' ' 

1 

4 


&STACK 
&STACK 
&STACK 
&STACK ' ' 
&GOTO -SEVEN 
-TWO &IF &OPT > 2 &GOTO -THREE 
&STACK "МОН LOADING , DONT TOUCH YOUR KEYBOARD ' 
&STACK JCOPY APLGRAFS GBOXPLOTAB GDEMO 
&STACK JPCOPY 990 CMSIO 

1 1 


TO EXECUTE THE FUNCTION STARPLOT TYPE : ' 


STARPLOT ы 


&STACK 

&5ТАСК 'FOR A DESCRIPTION OF THESE FUNCTIONS TYPE : ' 
&STACK ' ' 

&STACK ' INSTRUCTIONIONS ' 
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&STACK 
&ST ACK 
&STACK 
&STACK 
&STACK 
&STACK 
&GOTO 


&STACK 
&STACK 
&STACK 
&STACK 
&STACK 
&STACK 
&STACK 
&STACK 
&STACK 
&STACK 
&STACK 
&STACK 
&STACK 
&GOTO 


-FOUR &IF $&OPT > 4 &GOTO 


&STACK 
&STACK 
&STACK 
&STACK 
&STACK 
&STACK 
&STACK 
&STACK 
&STACK 
&STACK 
&STACK 
&STACK 
&STACK 
&GOTO 


-ҒІУЕ &IF &OPT » 5 &GOTO 


&STACK 
&STACK 
&STACK 
&ST ACK 
&STACK 
&STACK 
&STACK 
&STACK 
&STACK 
&STACK 
&STACK 
&STACK 
&STACK 
&GOTO 


' TO EXECUTE THE FUNCTION BOXPLOTAB TYPE 
1 


BOXPLOTAB ' 


-SEVEN 
-THREE &IF &OPT » 3 &GOTO 


-FOUR 

"NOW LOADING , DONT TOUCH YOUR KEYBOARD' 
УРСОРҮ APLGRAFS GSCATPLOT GDEMO 

УРСОРҮ 990 СМ510 


в ! 
"FOR A DESCRIPTION OF THESE FUNCTIONS TYPE 
И INSTRUCTIONIONS ' 
' TO EXECUTE THE FUNCTION SCATPLOT TYPE 
! SCATPLOT : 

-SEVEN 


-FIVE 


"NOW LOADING , DONT TOUCH YOUR KEYBOARD' 
JPCOPY APLGRAFS GDRAFTSMAN GDEMO 
JPCOPY 990 CMSIO 
"FOR A DESCRIPTION OF THESE FUNCTIONS TYPE 
! 3 
а INSTRUCTIONIONS ' 
' TO EXECUTE THE FUNCTION DRAFTSMAN TYPE 
! 8 
' DRAFTSMAN " 
0 8 
-SEVEN 


-SIX 

"МОЯ LOADING , DONT TOUCH YOUR KEYBOARD' 
JPCOPY APLGRAFS GLOWESS GDEMO 

JPCOPY 990 CMSIO 


"FOR A DESCRIPTION OF THESE FUNCTIONS TYPE 


INSTRUCTIONIONS ' 


' TO EXECUTE THE FUNCTION LOWESS TYPE 


LOWESS " 


~SEVEN 


-SIX CLRSCRN 
&CONTROL OFF 
&BEGTYPE -PAPA 


THIS WORKSPACE CONTAINS PROGRAMS THAT MAY BE USED AS EXPLORATORY 
DATA ANALYSIS TOOLS. PROGRAMS THAT ARE USED TOGETHER ARE 
CONTAINED IN GROUPS. 


THE GROUPS CURRENTLY AVAILABLE ARE GDEMO,GSCATPLOT „СВОХРГОТАВ ; 
GSTARPLOT, GDRAFTSMAN AND GLOVIESS, WHERE THE G STANDS FOR GROUP. 


IF YOU HAVE COPIED THE WHOLE WORKSPACE APLGRAFS YOU CAN SEE A 
LIST OF THESE GROUPS AT ANY TIME BY DROPPING INTO APL AND TYPING : 


YGRPS . 
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GROUPS : 


GDEMÓ. пози ra 


GSCATPLOT...... 


GBOXPLOTAB. 


GSTARPLOT..... 


THIS GROUP CONTAIN SOME DATA SETS TO BE USED 


FOR ILLUSTRATION BY THE PROGRAMS IN THIS WS. 


THIS GROUP CONTAIN ALL OF THE PROGRAMS REQUIRED 


TO PRODUCE SYMBOLIC SCATTER PLOT OF THO OR MORE 
DIMENSIONAL DATA. A BASIC DISCUSSION OF THESE 
DISPLAYS IS CONTAINED IN ‘GRAPHICAL METHODS FOR 
DATA ANALYSIS‘ BY CHAMBERS (PAGE 157) 

TO EXECUTE THIS PROGRAM TYPE : 


SCATPLOT 


AND THEN ANSWER THE QUESTIONS. 

YOU WOULD NEED THE FOLLOWING TWO DIMENSIONAL 

ARRAY : . 

- ARRAY OF DATA ( IN APL INSIDE THE 
WS OR OUTSIDE AS A FORTRAN FILE) 


FOR A DEMO USE THE FOLLOWING ARRAY : 
DATA -------- > CALHOS 
CALHOS CONSISTS OF COST PER PATIENT IN 14 GEO- 


GRAPHICAL DISTRICTS (ROWS) OF CALIFORNIA OVER 5 
YEARS (COLUMNS). 


.... THIS GROUP CONTAINS ALL OF THE PROGRAMS REQUIRED 


TO PRODUCE BOX PLOTTED TABLES (A COMBINATION OF 
BOX PLOTS AND A TABLE WITH THE ORIGINAL DATA ON 
THE SAME DISPLAY).TO EXECUTE THIS PROGRAM TYPE 


BOXPLOTAB 


AND THEN ANSWER THE QUESTIONS. 
YOU WOULD NEED THE FOLLOWING TWO DIMENSIONAL 
ARRAYS : 
- ARRAY OF DATA ( IN APL INSIDE THE 
WS OR OUTSIDE AS A FORTRAN FILE ). 
- ARRAY OF NAMES OF COLUMNS (AN ARRAY OF 
DIMENSION [NCOL;,201) 
- ARRAY OF NAMES OF ROWS (AN ARRAY OF 
DIMENSION [NROW,201) 


IF YOU DONT HAVE THE ARRAYS OF NAMES THE PROGRAM 
WILL ASK YOU TO ENTER THE NAMES ONE BY ONE. 


FOR A DEMO USE THE FOLLOWING ARRAYS : 
DATA -------- > CALHOS 


RON NAMES ---» CALHOSR 
COL NAMES ---> CALHOSC 


. THIS GROUP CONTAINS ALL OF THE PROGRAMS REQUIRED 


TO PRODUCE STAR AND PROFILE PLOTS OF THO OR MORE 
DIMENSICNAL DATA. A BASIC DISCUSSION OF THESE 
DISPLAYS IS CONTAINED IN ‘GRAPHICAL METHODS FOR 
DATA ANALYSIS' BY CHAMBERS (PAGES 158-1623) 

TO EXECUTE THIS PROGRAM TYPE 


STARPLOT 
AND THEN ANSWER THE QUESTIONS. 
YOU WOULD NEED THE FOLLOWING TWO DIMENSIONAL 


ARRAYS : 
- ARRAY OF DATA ( IN APL INSIDE THE 


47 


WS OR OUTSIDE AS A FORTRAN FILE) 

- ARRAY OF NAMES OF COLUMNS (AN ARRAY OF 
DIMENSION [NCOL,;201) 

~ ARRAY OF NAMES OF ROWS (AN ARRAY OF 
DIMENSION (МЕОИ,201) 


IF YOU DONT HAVE THE ARRAYS OF NAMES THE PROGRAM 
WILL ASK YOU TO ENTER THE NAMES ONE BY ONE. 


FOR A DEMO USE THE FOLLOWING ARRAYS : 


DATA -------- > CARS 
“ ROM NAMES ---» CARSR 
COL NAMES ---> CARSC 


CARS IS THE CAR REPAIR DATA GIVEN BY CHAMBERS, 
ET D. 


GDRAFTSMAN ..... THIS GROUPS CONTAINS ALL OF THE PROGRAMS REQUIRED 
TO PRODUCE DRAFTSMAN DISPLAYS OF THO OR THREE 
DIMENSIONAL DATA. A BASIC DISCUSSION OF THESE 
DISPLAYS IS CONTAINED IN 'GRAPHICAL METHODS FOR 
DATA ANALYSIS' BY CHAMBERS (PAGES 136-140) 

DETAILED EXPLANATIONS OF THESE PROGRAMS ARE 
CONTAINED IN ‘DRAFTSMAN DISPLAY ; А GRAPHICAL 
EXPLORATORY DATA ANALYSIS TECHNIQUE' AN NPS THESIS 
BY CAPT. MALCOLM JOHNSON, USA. 

THESE PROGRAMS ARE COMPLETELY INTERACTIVE AND CAN 
BE INITIATED BY TYPING : 


DRAFTSMAN 


AND THEN ANSWER THE QUESTIONS. 
YOU WOULD NEED THE FOLLOWING TWO DIMENSIONAL 
ARRAYS : 
- ARRAY OF DATA ( IN APL INSIDE THE 
WS OR OUTSIDE AS A FORTRAN FILE) 
- ARRAY OF NAMES OF COLUMNS (AN ARRAY OF 
DIMENSION [NCOL;,201) 
- ARRAY OF NAMES OF ROWS (AN ARRAY OF 
DIMENSION [NROW,20)) 
IF YOU DONT HAVE THE ARRAYS OF NAMES THE PROGRAM 
WILL ASK YOU TO ENTER THE NAMES ONE BY ONE. 


FOR A DEMO USE THE FOLLOWING ARRAYS : 


DATA = EE > CARS 
ROW NAMES ---> CARSR 
COL NAMES ---> CARSC 


GLOWESS....... THIS GROUP CONTAIN ALL OF THE PROGRAMS REQUIRED TO 
USE THE ROBUST LOCALLY WEIGHTED REGRESSION SCATTER 
PLOT SMOOTHING TECHNIQUE DESCRIBED IN ‘GRAPHICAL 
METHODS FOR DATA, ANALYSIS' BY CHAMBERS (PAGE 121). 
DETAILED EXPLANATION OF THESE PROGRAMS IS 
PRESENTED IN ‘LOCALLY WEIGHTED REGRESSION AND 
SCATTER PLOT SMOOTHING; A GRAPHICAL EXPLORATORY 
DATA ANALYSIS TECHNIQUE' AN NPS THESIS BY 
CDR GARY W MORAN, USN. 
THESE PROGRAMS ARE COMPLETELY INTERACTIVE AND 
CAN BE IMPLEMENTED BY TYPING : 


LOWESS 


AND THEN ANSWER THE QUESTIONS. 
YOU WOULD NEED THE FOLLOWING TWO DIMENSIONAL 
ARRAYS : 
- ARRAY OF DATA ( IN APL INSIDE THE 
WS OR OUTSIDE AS A FORTRAN FILE) 
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~ ARRAY OF NAMES OF COLUMNS (AN ARRAY OF 
DIMENSION [NCOL,20]) 
- ARRAY OF NAMES OF ROWS (AN ARRAY OF 
` DIMENSION [NRON,20]) 


IF YOU DONT HAVE THE ARRAYS OF NAMES THE PROGRAM 
WILL ASK YOU TO ENTER THE NAMES ONE BY ONE. 


FOR A DEMO USE THE FOLLOWING ARRAYS : 
ПАТА -------- > CARS 
ROM NAMES ---» CARSR 
COL NAMES ---> CARSC 
-PAPA 
&GOTO -ONE 
-SEVEN EXEC APLGST 
&EXIT 100 
-ЕККОКТ &ТУРЕ YOUR VALUE HAS TO BE BETWEEN 1 AND 6 TRY AGAIN 


&GOTO -ОМЕ 
“FINAL &EXIT 100 


2. APLGRAFS VSAPLWS 

The following is a description of the content of the APL workspace APLGRAFS 
VSAPLWS, which contains all the functions needed to use the programs described in 
this thesis. This workspace contains several groups, each groups is related to an 
especific program, and contains the functions required to execute that program. 


Following is a list of groups and functions inside those groups. 


Group GBOXPLOTAB 


Functions BOXPLOTAB ADMI BOXLINES 

Group GDRAFTSMAN 

Functions DRAFTSMAN DRASYM DRAFT 
REPEATCK MINMAX TRANSFORM 
JJITTER SUB ADMINS 
LABELS REGRES REGRES2 
LONS YMAVS MOVS 
MMOVAV GARY GARY2 

Group GLOWESS 

Functions REPEATCK LOWESS REGRES2 
LONS REGRES PLOTQUERY 
DATAINPUT 

Group GSCATPLOT 

Functions MINMAX TRANSFORM JJITTER 
SCATPLOT ADMI 

Group GSTARPLOT 

Functions TRANSFORM STARPLOT ADMI 
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3. APL PROGRAMS. 

This section contains the program listings of the APL programs written for this 
thesis, and the modified version of some existing APL programs taken from Johnson 
(Ref. 4] and from Moran [Ref. 5]. 

a. BOXPLOTTED tables (Program BOXPLOTAB) 


[OJ BOXPLOTAB; DATAO;DATA;IPL;NROW;NNCOL3;NNNCOL;NCOL;PLO; 
UIND;YL;S20;DIF;LHEAD;YN;UIND1;UIND?2:; К ORDEN ; SORT; 
ORD1:XL3;BAS1:COL20;XX3MEA3VAR3:MED : DATA2 3SCRE3TX3LX3 
Е YY;COA;DATAO1;0RD1;BASO;CONT; Батаа: 


DMI 
ATAO<DATA 


PL-0 
PRCDz'Y')/O 
HOW lt DATA 

NROI/50)/L01 

THE MAXIMUM NUMBER OF ROWS ALLOWED IS 50 , TRY ACAIN! 
01:NNNCOL-NNCOL- 14 DATAO 
L02:PLO<UIND<+XL<YL<S20<20 DIF<' ! 

LEDTER d SCREEN LABEL! 

'DO YOU HAVE A (CoL 20 CHARS) MATRIX WITH THE NAMES 
OF ши. ҮІІ? 


СУШЕ ПУ ВОЗ 
ENTER THE NAME OF THE MATRIX! 


IND1-U 
INDie, (Ссора а Па 


Т 
03:7-<0 

IND1<UIND2«<!! 

Он * 7 7 + 1 

ENTER THE LABEL FOR COLUMN NUMBER ',5(Г) 
IND1<UIND1,( 20+(S20,0)) 

(T<WNCOL )/L04 


ПЕ 
DO 228 HAVE A (NROW 15 CHARS ) MATRIX WITH THE NAMES 


ds 
Fi 


+0 
Vive ty? )/LO 
b THE VANE OF THE MATRIX! 


1. [] 
INDZ<UIND2 PT5] 
1055 


у 


жаа C5 
= 


MBER ел) 


Cry 
= 
Е 
“з 
М 
Se. 
Cry 
ч 
Та 
oo 
pa Ur 
сл Еч 
rj 
© 
“з 
“з 
© 
= 
—< 
C 


OOF EFF F FEF FEE WWWWWWWWWWHYDNY hOhOhOhOhOPOh2hOFSEAEAEAEALBR RPRPRPOONMOFEWONFR 
A 
N 
ч 
© 
C 
оз 


мы ошооо У Ол {= о мн ОО оо УО л 4 мн ОО СО Сум Оэ Uu (ON HP. OmU Oo -JO0)x)0 ££. шмон олли лил 
ол лп ли пп пи пп и илии ии лил л изеу IL JL IL IL IL JL JL JL IL JL и н ду ое 


гии ии ии 1 гии им 0M Far a a6 Mw aM MW aM 0M WM 


Гар еее е че 3 F t -OG YG -at aa 
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PL<IPL+1 

OL<6 
ог В 
6 :NCOL<NNCOL 
L<IPL+1 


OL<0 
7:DATA«-DATAOLs CCCZPDL-1) 6)* NCOL)] 
DM (((IPL-1)6)+ dE 
NT<,&(NROW,(NCOL)) (( NCOL)+1) 
TA1<,QDATA 
SO<! an50vVCONTNVDATA1NVOVAVBOX;01 .2VNVPLOVLHEADWVXL 


-Y 
I „1.16 .20.92.85VVLIN 1.8 7.2VLINWV1 10V 
UN BASO 
OL20«(NCOL+1 ) 20 
Х< С0120 


<1 
HO-(NCOL-1)0 
RHO1:I<I+1 

Е ZJOR[:I]) 

(TIE>(NROW 2))/LRHO2 

своя 
(МВОИх2)-1)) 


03 
O2:NA«-NROW CCCNROV *1) 2)x2) 
[2-1]-«CCC-Z CZORE:Z-11*2))2-N1)x0.5) CCCEZCOGZORLD 5I] 


))-N1)*0.5) 
[ ue JOR[:I-1] JOR[:I]))-N1) RHOLI-1] 

0 NCOL ) )/ CRRO1 

( 20%520,!РАМК СОВЕ..-!2,(204 әЕНО), 


7204820, !МЕАМ .-!), (20 Ч ФМЕДМ РдТтА)) 
204S20,'VARIANCE .-'!), (20 Ч VARIANCE 


7204520, МЕРІДМ .-!), (20 ч Ф(СОГ20 


4,VAR,MED) 
ZON ! IUINDALC(IPL-A) 120)4 (NCOL 20)] 
8,0.9 


F^1 w^ ees 
cM uS mM 
сне Су 


Оу Bon Uy 
Mm дъ СУ С СО 


да ЗЕ аре Одо 


SR Qty) why 
с ош, шпа 
№ муд Og 

513 
“Суға 


Hj RP e 


Qj 
е: 
у, 
T 
D 
Mj 


mE 

е 
WAY мл RO) 
Se A AAR 
aH 


О 


UIND<' 0 
UIND+(COL2 
SCRE<0 ,0.8 
TX<1 


v ә 
он 


° м. 
E 


La dos vYeyyI-cUINDSSSIZ---S20----- ВЕРИЛИ С! 
BASA-BASA1,!'140-LIN LX TXV1 O OVO 10 OV! 

YY-COL201 

RUN BAS1 

SCRE<0 ,0.05,0.98,0.15 


о 
© 
hy 
ф 
SN 
+ 
> 


I>4u)/MOO01 

IND<(COL20,1) DATA2C(5-I)3] 
reCOL2Z0 T 

a BAS1 


LZ<2 
Gn#<0,0.2,0.98,0.85 


701:1-1-4 
IND-(COL20,1 ола ), (5О ФОЕРЕТ1), 


) 

20 2 DATALI:] 

7<с0120 ((МЕОМ+1)-ГТ) 
N BAS1 


Uca s ge) Sx CL 
>< © 
4 © 
HB к 
кч 
^ 
о 


UN e O O ON O Еаоаме оси ил ли пл вр о реми и ил мл ма га ги + (аи ин Ses үкен а ын н нн н mS i ДНИНИ ИИ 

дъ 

с кч 

SN Lr] 
C 
^ 
т” 
Су 
QO 


ee НН НН НН НН НН НН (О ЦО Ц (0 ЦО Ц 0 0 2 Ц 00 СО СО СО со OO WMD ONN NNN NNNNNOOOMO OQO 00000 слелелелелолсл 
ee мжеоосососсосососоопоо 911 Бомькьосолдаа ош Ne офю NOO FWNRPOWOND м FWHPOWONOMFEW 


геге гел лг лг лг лг лг лг лг лг лг лг лг лг лг лг лг лг лг л + 1 marnma г г ггг ггг ггг 


Let Ll) 
Sy UP 


bue, cr 


5] 


И 
YOz!Y' )/M02 
PA 
е 


USE 
O YOU WANT TO JOIN WITH LINES DATA POINTS OF TRE 
SAME POSITION! 
(!Y'z14(0)/MO2 
ВОХ ENTER THE POSITION OF THE DATA POINT (ENTERO 
TO A 
O-DP-U)/HM 
Е pase 41 "OXLINES DATA 


TO PE 

ДЕ 
-Q.e 
ION 
15: 
ма 
ON 
byw Dy 
ÈN 

3 


ШЕ 

/0 

A: : ! DO You WANT TO SEE THE DIFFERENCES BETWEEN 
OLU! MNS Y/N?! 

а ^0)/0 


+0 

К 1 

<(NNCOL,1) NNCOL 

d Дур, ABSOLUTE DIFFERENCES (A) OR RELATIVE 


A'zDIF1-1^0)/TUMA 

HEAD-'ABSOLUTE DIFFERENCES BETWEEN COLUMNS! 

THE DIFFER RELATIVE TO THE FIRST COLUMN (F) OR THE 
PREVIOUS (P)? 

!'P!z1 DEB /TUMAO 

АТАО< | (РАТАОГ; WNCOL] -DATAOUs; (1* NNCOD)1) 


MA? 

UMA01:DATAO1<&(NNCOL ,NROW) DATAOL;1] 
ATAO« | DATAO1-DATA(; (1+ NNCOL )1 

Oa« (NCOL , ‚11 


UMA1:LHEAD<! RELATIVE DIFFERENCES BETWEEN COLUMNS' 
HE DIFFER. RELATIVE TO THE FIRST COLUMN (F) OR THE 
PREVIOUS (P)? 

(!P!z140)/TUMA1 

DATAO- С(ратаост п ааа -DATAOL; (1* NNCOD)1) 
РАТАОГ: NNCOL] 


TUMA11:DATAO1-8(NNCOL,NROW) DATAQU11] 
DATA0« | CCDATA01- DATAOL; (1+ NNCOL)] 135 DATAO1) 100 


Au О 
E 


CO zhg 
Se orit 


> 
= I 


= 


COA-C(NNCOL 

TUMA2: сад ИСО 15)! DIFF. BET. 
UIND1-,A4, S(COÀ)), CCNNCOL, Т) бу О 
$ ( (ММСОГ, ы e COLI) 


гг rar wow ww 1 гг rar www ww M3 гии гг лг лг лг гаги нар гг rar ww 1 
PBR opBHBHHHH pn де че ча че че де че де че че кызынын кы қын 
лл (лл Ес ЕБЕ Его 0бь0ошо WWW WWHNYNY NNNONNNNN PP қыны 
WN к ОсОО) MOE осожювошолофол FWN PODON бШЕЮФЮНО WO NOME 
ел LLJL JL JL ла eit (әсел 3 [ШИЕ | БИШ | БЕШ илл из гл) сага ЕЯ 


UIND2<UIND2CORD; 
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b. STAR plots and PROFILE plots (Program STARPLOT) 


LO] STARPLOT; PRCD;ANS3;SCOL:;SROW;NCOL; NUP; INC;MAX3SINT 
; COST; BASIL: MIN;SPA;TC;iPI;ONE;R;C;POSN;XAXIS;P; 
BASO; BAS1: s XX: TYP 

TYPE (4) Kok STAR PLOT OR (P) FOR PROFILE PLOT ' 


АШ 
(ТЇР&!8'Эл(ТҮРЕ!Р! ))/удО 


PRCDz'Y')/O 
NE<I<0 
СОГ= 1+( DATA) 
RON<1+% nr DATA) 
DO YOU HAVE A (NROW 20 CHARS ) MATRIX WITH THE NAMES OF 
ROWS Y/N?! 
> (1у1т2 (140) ) /.700 
! ENTER THE NAME OF TRE MATRIX OF NAMES! 


mma ara armada 
ee OONOUNFWNEe 
paco рола оо ло ил ла 
S E a 

Y 

> 


ла 
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TO. m TS. m 
© 


= 
о 
ENTER THE N ME ROR ROW NUMBER ' ,¥(I) 
<- 
( 
0 
р 


= 
кез 
BNO +S -= 
ee peris 
So HS 
М х 
"з 
txj 
= 
S 
РУ 
4 
PÚN 
= 
Ky 
F3 
mi 
М 
чә 
ty 
= 
"s 
Ss 
[с] 
Сл 


су-су - + 
ох ЧО Суи 


. ALL/SEL?! 


АМ5%151 
ENTER AS A VECTOR THE SELECTED COLUMNS ' 
DATA«-DATALs:SCO 
K01:'DO YOU RANE ALL THE ROWS OF YOUR MATRIX OR 
ол” ROWS (ALL/SEL)' 


NS<1 

(ANSzZ'S!')/K 

ENTER ASA VECTOR THE SELECTED ROWS ' 
о [SROW-(: ] 
C 
R 


И 
де 

a A O 
Жез 
= 
Cp 
C 

ww С 


< © 
е 


2: TRANSFORM 

OL< 140 ВАТА) 

аусар A) 

ENTER NUMBER OF PLOTS PER SCREEN (340R5)! 


ГТ АҒ ггг ггг геа гии mnan 
МУМУ OQQAN ллллоллолгггГЕГЕГГГГГООООООООО МММ  МЮМЮМХММКММММСҰҮННН ананын 
NQEWMPOWONMO Гомо Оо Еомюнкоооочо ЕЕОМКЮНК ОКОООЧООХЛЕО во ONOMELWHPOOWONOM FWP 
нн ңы ің нн қы (ңын нн IL IL IL нщ ны» анна ін пн нон қан әд IL OL OL IL) 


A 
-> 

! 

D 

K 

N 

N 

CON 

КОРЕ 0 

-CON ((NUP»2)^(NUP«6)) 

N ОМВЕН ОЕ PLOTS HUST BE340R5 , TRY AGAIN' 

> 

CON: INC<0.95 NUP 

ONE-(NCOL,1)1 

MAX«-MIN-NCOLO 

-(TYP-!'S!)/LO 

XX< Са Маб (( а 

Ме; NCOL,1)0),XX,ONE,ONE ian 0) 
Me( (C2 NCOL ЗОРЕ (з) E 1004. о 0) 
LO:SINT<10(02 EE 1)) NCOL 

С057529(92 6 МСО Tu ) NCOL 

M«-ONE , C CNCOL , 192 (0.8 COST) 41) 2)), ((NCOL,1) 
(((0.8 SINT)+1)2)) 

MeCCC2 NCOL), 3) On M uc 3) (1 5)))) 
LLO:BAS<'paal2¥N¥3%. 0 BPSRPROPFSASLINSLINVOFFST: 
RON BAS 

ANGELA 

>(TY P-ip!)/LL 

Me ((NCOL, 1) CCCOST+1) 2)), ((NCOL,1) (CSINT+1) 2)) 
ELO: Me ( (COL DÈC (1INCOL) ((NVCOL)-1)))-0.02), 
(CNC ог, 1)0.5) 

ANG 

104 Теге 

МАМ<НСГТ: 1 

Рози (в (МТ: 11 ,„М.71:21)),! Е 

BAS< | an 2VNANYOYCVANCYEYNOVNOY' , POSN ОЕР! 
MINCIJ<DATACi+ADATAL3I13;2I) 

MAXLI]-DATALC 1)4^ADATAUD;I1;I] 

-(I«NCOL)/LOO 

PAUSE 

5РДе<!! 
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[76] I< 
[77] TC<-NUP 
[78] LOOP: TC<TC+NUP 
ето. 05. (1 (TNC: (INC6))),(0.05+(INC-(INC E DOR 
[81] LOOP2:R<+R+1 
[82] C<0 
[83] LOOP1:C<C+1 
[84] 7<=7+1 
м (-INC),INC,(-INC)) ((C-1),(R-1),(C-1), 
[86] XAXIS-NLI;] 
[87] Pe(DATA[I:]-MIN) (MAX-MIN) 
[88] +(TYP='S' 7.000 
E БО, 15 CONE, XX ,P),EL11((34,3) (15 XENCOLdS 01 ERE 
> /у 
[91] ings d {(NCOL ‚1) (((РСО6Т)+1) 2)), СОСО 
1 
[92] MeM lid CC OWE З), СМ, СОЛЕВ ОС 5) 
2931 У000:8250<!а1259У919.0.0 РРУЕРЫО ОЕЕЗРОВПФ ТМД ТМЗОЕРЯ: 
[94] В451<'вв О .07 ЕРУОМУ! 
[95] RUN BASO 
[96] RUN BAS1 
[97] »(I2NROW)/ENDO 
[98] > MEO DS =; 
[99] > ( (С<МИР )л ( (2C4C)<WROW ) )/LOOP1 
[100] (R«NUP)/LO 
[101] END: PAU 5g" 
[102] (CPC HA ENRON )/LOOP3 
[103] ENDO: PÀUSE 


c. CODED SCATTER plots (Program SCATPLOT) 


ГОР! SCATPLOT; QUE1:;CX;CYI:I:;DATA1:; LHEADS LPLOTs3NCOLs LABX 
YiEXXiEXY;POSN; "Basi: EXPRE;SYM;COLiSYZ;DESCHRI; 
POSLEÓ; A1; X3Y; SPA; X 5S Y 3 PLOTA; PLOTO ; PLOTLEG 


[1] POSI-O 
[2] ADMI 
[3] DATA1<DATA 
[+] —-END MED тут) 
[S5 TONE Diva. 
[6] ' YOUR DATA IS NOT A TWO DIMENSIONAL ARRAY , SCATPLOT BEING 
TERMINATED! 
nu E NS REFORMAT YOUR DATA AND START AGAIN! 
> 

[9] ONE: РАТА«РАГА1 
[10] SPA-! ! 
[11] NCOL< 1% Пади 
[12] (POSIZO)/ONE 
Ici а SCREEN HEADER! 
L15 z UE сащ еи THE PLOT HEADER '! 
[17]?! EU THE VARIABLE (COLUMN) FOR THE X AXIS' 
[18] X-DATALs;sCX-UJ 
122, Ее ШЕ ЕЕ ТНЕ LABEL FOR THE X AXIS '! 
C21] 'DO You PANE ALL THE VALUES OF X OR JUST A SUBSAMPLE 

ae "n TOLMLDAZSUDB)! 
[223 ДЕН «1 ^[!] 
[23] 770 EARD Ea 
[24] 'EN ДЕН AN APL EXPRESSION WITH THE н OF VALUES FOR X! 
n Dd С» (РАТА[:', (СХ), ' 12500 )л (РАТА Е", СОО И ТИЕ 
[27] DATA-EXXZDATAÀ 
[28] THO:'ENTER THE VARIABLE (COLUMN ) FOR THE Y AXIS' 
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FIVE: +SIX (POSI>1) 
ENTER A LAB EL DESCRIPTION) FOR THIS CATEGORY (MAX 25 


С 

DESCBTe254 

DESCRI-*(9$I Да DESCRI,SYM,' ',(8SYZ) 
POSLEC<0.8,(0.75- (15) 100 


ELOTEEC* PT. ЗЕВСА: ,COL,'VOVLNONVS3VYESVNONPOSLEG RS 
RUN PLOTLE 
aun ' DO 0 100 WANT ANOTHER CATEGORY (YES/NO)! 
= 
2005 ( (00Е1-1У!) 


РО51>1)/90Е01 
AUSE 


<141 
UE1-'Y')/ONE 
USE 


[298] Y-«DATAU;CY-U] 

ЕЗ о, ШІ Tg tT ABEL FOR THE Y AXIS! 

[32] 'DO YOU WANT DON THE VALUES OF Y OR JUST A SUBSAMPLE 
OF IT (ALL/SUB)' 

[33] AU <14 

(341 2701 (QUE1= 

hoo ENTER AN APL EXPRESSION WITH THE RANGE OF VALUES FOR Y! 

Poets Ce COATAL: ', CeCcY),'12500)A(DATAL;:', (oCY),'151000)' 

[371] ЕХУ<0 

[38] DATA<EXY/DATA 

[39] TWO01:Y<DATAL:CY] 

po] XeDATAU:CX] 

C41] JJITTER 

C42] TRANSFORM 

[43] MINMAX 

[44] 7<=0 

(451 О ОО ГО RE BLOT BIG 220822 5 

[46] POSI-POSN-U 

[47] LOOP1 (POSI>1) 

[48] POSN-0.10.10.80.,.8 

ШЕП 700Р1:7<Г31 

204 130 (Т) 

[52] 'ЕМТЕА IN AN d A rM ei THIS CATEGORJY! 

НЕЗ | 17, (DATA 5 )^ (РА SN 5I 

bool 'USE DATA AS one NAME OF FOUR VECTOR ! 

[55] EXPRE<[ 

[56] 'ENTER THE SYMBOL ! 

[57] 5УМ<=П 

ШЕРІ VENTER THE COLOR , I.E. BLUE! 

[59] COL-« 

[60] ЖЕ ЗЕ SIZE , AS A NUMBER BETWEEN 1 (SMALL) AND 

[61] SYZ< 

fee] SYMBOLS<SYM,':!',CcoOL, 25437 

L63] FOUR (Iz1 

[64] PLOT1I<'anlOVxvVY¥VoEXPREY' ,SYMBOLS , '¥VSPAVSPAVSPAVSPA! 

[65] PLOT1<PLOT1, ' SPOSNNVPXPVA O0 OO 1 0 OV! 

[66] RUN PLOT1 

[67] FIVE 

EE Lu US HEU uoTmPObSS VEPLOPVGHEADN 

[69] ааш '" VIABYVPOSNNNLIN LX TXVLIN LY TYV111 

70] RUN PLOTO 

71] 

72] 

73] 

74] 

481 

76] 

7 1 

781 

741 

801 

811 

821 

831 

881 

85] 

86] 

87] 

881 


Meee eee eee Oe eet rw 


E 

END 

OU: 1:'DO YOU WANT ANOTHER PLOT (YES/NO)' 
РА 

Е 
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d. CODED DRAFTSMAN plots (Program ОКА 
DRAFTSMAN; NCOLIPI; RC, d ТУ%Т72У;ХАХІ5; УДХТБУЛ ТЕ 
TY;ANS;F3;ROB3Y13X13:YS3;M3;NUM;PRCD3DIM;YM3;XM;UM;SKP 


MINS 
A<+SYMBOLS<LPLOT+<LHEAD<XAXIS<YAXIS<'! ! 
MBOLS<' 0! 


= ТА 
LP1 (PRCD="'Y') 
1:>LP2 (DIMT3). 
ay beso ОЛЫ 
22, four: DATA Sb? IS NOT A TWO OR THREE DIMENSIONAL 
DRAFTSMAN IS BEING TERMINATED. PLEASES REFORMAT YOUR 
REINITIATE DRAFTSMAN! 


P4: N DRAFT DATA 


Oy ed + GIN ы 
SUS 


= No 


Qty 
CO 


Оре 1 ( Рата) 
ITTER 
RANSFORM 


nee НАРТ A SYMBOLIC DRAFTSMAN (YES/NO)! 
ON1 с ) 
A 


D 

COL-DRASYM DATA 

НЕАр< 

е 1 

YOU HAVE NOW ',(®NCOL),' BASIC VARIABLES TO PLOT! 
041: 'ENTER MEE OF PLOTS PER SCREEN (340R5)'! 


P 
ON ((NUP>2 )ACNUP§<6 ) ) 
Е OF PLOTS MUST BE 34 0OR5 , TRY ACAIN: 


ON:TR<+-NUP 
NC<0.95 NUP 

Шан ода ET R+NUP 
TC< 
Г, 
V 


MOG) „сусу Саар -QNAL чо ЧО 
о СЄ "rj = з 


БОРА  TC-TCAN 
Tg gus МС (INC 6))),(€0.05+(INC-(INC 6))),1 


“ч 


R<0 
200Р2: : R-R1 
YeDATAL T6 ; (TR+R)] 
X 

( 

P 


0 
ДЕ Б) 
TC+C ))/SKIP 
о (INC, (-INC),INC,(€-INC)) ((C-1),(R-1),(C-1), 


НН ;1 
ТЁ+Ё 
R= NOP Sv ( (LR+R)= NCOL)))/GRAPH 


T 
1 )/GRAPH 
ois (Сы: | 
<- l 
R=NUP )v ( (TR+R )= NCOL ))/GRAPH 
XIS- <ҮДАТ5е! 


ДИ ут o 
SMT='M' )/MOV 
LOWS Y 


WS 
MOOTH<'Quvx¥Y;YSV¥0 141%. ай 
MOOTH<SMOOTH , ‘¥LIN LX TXYLIN LY TY¥1 1 1¥10 11 0 


Б О О оо У Ссл 4 ом Сс) О сом MAHFLWNHPOKOONMMNFEWHPOKWOONMUNIFWHRPOWOONMONFOANDRP O ылда LI 


гаас mmm ammo листите mamma MMMM TOTO Mees (0 
холашачшасшдоша в ЕЕ FEE EEELEWWWWWWWWWWNHNMNYNYNYNYNNYNNRPRPPRPRPRPRPRPRP RP OONMAMFEWNHNPrP O 
ли Ee ла аа | 


X 
n 
( 
X 
( 
X 
Y 
( 
X 
( 
Х 
5 
5 
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[62] RUN SMOOTH 

БӨЗ! ӨКІ 

[6x4] NOV: Pu MOVS Y L AX] 

[65] УМ< 

[66] М NOVS XCAX] 

[67] XM«-UM 

[68] SNOOTH1<'auvx: XMVY: YM¥O 191%. PSPAVSPAPXAXISVYAXISVPOSNN 
[69] SHOOTHI< SMOOTH , 'LIN LX TXNLIN LY TY¥1 11910 11 
[70] RUN SMOOTH | 

[71] SKIP 

72] FIN: BASe'nn10VXVYV', ,EXP,'S' ,SYMBOLS ,' VLPLOTNLHEADX 
[73] BAS«PAS, ! oU AXISVPOSNVSLIN LX TX¥LIN LY TY¥111 
C74] RUN BAS 

[75] SKIP:>(((TR+R)2(NCOL) а О ИВ 
(761 а LOOP1 

[77] (CR<NUP )ACCTR+R )<NCOL ) )/LOOP2 

[78] END:>(ANS='Y')/SKIP1 | 

[79] WI GARY2 INC 

[80] SKIP1:PAUSE 

[81] ERASE 

[82] и 

[83] CCZR-R)«NCOL)/LOOPM 


e. Suporting Sub-programs 


FUNTION ADMI 
DMIsSQRA;QR2 


A 

A 

а FUNCTION ADMI CALLED BY FUNCTION omo USES 

а FUNCTION CHSREAD , THIS FUNCTION IS A MODIFIED 

a VERSION OF THE FUNCTION ADMINS FROM DTWLENS VSAPLWS. 
А 

Р 


щеше y! 

ЩЕ оон DATA SET LOCATED IN THIS WORKSPACE? (YES/NO)' 
= 

о (QR1z'Y!) 

P1:'7S YOUR DATA SET LOCATED:' 

(1) IN AN APL DET LOCATED ON THIS DISK ORON A DISK' 
THAT YOU ARE LINDED TO' 

EE T FILE ON THIS DISK ORON A DISK THAT YOU АКЕ! 

Cae) M E DEO TES QR (2) ABOVE' 

M TER (1 op 


<0 
Р2 ,LP3 n R2] 
170 TRANSFE oun DATA TO THIS WORKSPACE: 
457 ЕЁ... )PCOPY (WS NAME) (DATA SET AME! 
ТУВ: )PCOPY DTNLDATA CARS! 


DATE AND TIME SAVED INFORMATION IS DISPLAYED! 
WHEN THE TRANSFER IS COMPLETE. THEN ENTER CO! 
TO PROCED WITH SCATPLOT'! 

AADMI<GO 

0: ene THE NAME OF YOUR DATA SET ! 


TA< 
TM<- DATA 


D 

P3:'TO TRANSFER YOUR CMS DATA FILE TO THIS WORKSPACE ' 
ANSWER THE FOLLOWING QUESTIONS ABOUT YOUR DATA SET'! 

T 


-NAD ~ ~~ = = HQ 
< = 


TA<CMSREAD 
us DATA 


Оол Е сом Оо ооо У Ол = ом Оо шоо УО л 4 әм озн илии) 


гг лг лг лг лг лг лг лг лг лг лг лг лг лг лг лг лг лг лг лг лг лг лг лг лг лг лг лг лг лг лг лг лг Г 
CO CO CO CO CO COCO PO IO PO PO PO PO PO PO PO DIS: HS HS IS ES ES IS ES IS IS CO O0. - OD O1 - COND I CO 
клипа я ляп пла c. 


WY -NUUNUU ~ = = = = 
= v= ma 
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[37] LPu : ' YOUR DATA SET MUST ВЕ STORED IN AN APL WORKSPACE OR ' 
[38] 'INA CMS FILE LOCATED ON THIS DISK ORONADISK TOWHICH ' 
[39] 'you ARE LIN SCATPLOT IS BEING TERMINATED. PLEASE ! 

[u0] 'COMPLY with CONDITIONS (1) OR (2) AND REINITIATE SCATPLOT' 
[ud] PRCD<'N' 

[42] END: 

FUNTION BOXLINES 

[0] DAT-JQR BOXLINES DATA 

[1] NCOL- ATA 

t23 NEOUS1L DA 

L3] Mx 8 (1r (NCOL-1)9 

(41 МХе<1,МХ<,8(2, МХ) (МХ,МХ) 

[57 Je (l l^ MX).,1 

ES Wes ((1. (NROW-1)) JOR<JOR[MX]- 

[7] МХА ¢ trad 14 ИХ)),1) ((МХ (214) ) nx) ((21 140), 

C8] JM<(( MX) JM),MX,( (MX) JOR) 

[9] BAS2<! ш е, 0 RPYRPBOND 0.20.98 .85 VLINVLINVOFF¥'! 
[10] RUN BAS2 

[11] END:DAT<0 

FUNTION DRASYM 


LO] NCOLeCDRASYM МАТЕТХ;СТ:СУ:Т;5ҮЙҮСОБРБ; 55 
Е о THE JizlABLES (согу: THAT YOU 


C2] 'IN THE X AND Y AXIS (THE FIRST AND SECOND DIMENSION 
«0 ТНЕ РЕОТ)! 


C 
N-NLCCI):] 

DATASMATEIXU: СГ] 

NCOL< CLI 

I-Q 

EXP-SYM-COL-SYZ-!! 

! NEXT , YOU ор TO ENTER APL EXPRESSION FOR EACH 
CATEGORY (CODE)! 

| USE AS 520) NAME OF YOUR ARRAY! 


| I.E. (ХХГ: Г1>100ОАСХХГ УД шоо) 

n тар J REPRESENT COLUMN NUMBERS BETWEEN 1 

'B BE CAREFULLY NOT TO OVERLAP VALUES ' 

WHEN THE PROCRAM ASK FOR SYMBOLS TYPE ANY (ONE) 
CHARACTER! 

'FOR COLORS TYPE THE NAME OF THE COLOR I . E. BLUE OR RED! 

"WITH SIZES 1 REPRESENT SMALL AND 12 BIG! 


ГОЦЕ Е аа 
' ENTER THE APL EXPRESSION FOR THE CATEGORY (CODE) 
NUMBER ! ‚ ($ I) 


E 
EXP-EXP 
ENTER "Tug SYNBOL 
SYM<S 0 
а, THE COLOR! 
ENTER Tug size 
SYZ-SY 
120 You Warsi ANOTHER CATEGORY (YES/NO )' 


LOOPA (ANS-'Y!) 
EXP-2VEXP 


шм ошоо молк шош юмоосо NOON Ешмно зал 


mmm гэг tomato ama oar 
соо оо козы кы ыы соч Осло 
лл LL Ll LIL LLL 
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C34] SYMBOLS<(1¥SYM),'3:',(€24¥COL),'3',(2+¥SYZ) 


FUNTION DRAFT 


pu D RTMIDATANCODSTHRSTCSPISKRSCSYSTN:ITO2NSXAXISS;YAXIS; 
n ee ALilosMINUMSNEAGSVARTNORETXU: 

[1] aaa DO NOT MOVE OR ERASE; GRAFSTAT FUNCTION HEADER 

[2] ana CRAFSTAT WILL NOT ADD ALINE TO THIS FUNCTION 
WITHOUT THIS HEADE 

[3] 'ZHE THREE DIMENSIONAL DRAFTSMAN DISPLAY IS BUILT 
ONE VARIABLE AT 

СЧ] о PROGRAM WILL ASK YOU WHICH VARIABLE YOU 

mo) TEACA TIME Ж, 75 Ды FOR ANEW ONE. THE DISPLAY 
PRESENTED FOR EACH! 

[6] Nou cae REPRESENTS THAT VARIABLE PLOTTED AGAINST 


[7] 'VARIABLES PA СЕ BY PAGE. THAT IS, THE FIRST ROW 
REPRESENTS THE FIRST! 

1 'PAGE OF DATA , THE SECOND ROW REPRESENTS THE SECOND 
PAGE AND SO ON' 


SPA<'! 
Я 


DATA 
QOP5:'WHAT VARIABLE DO YOU WANT TO LOOK AT?' 
к со ЭЛЕСИ (в ONCOL Р) ру) р 2} м 


€XUt*0,1 XU-[ /L/DATALs s CVAR )1 
NE e ARDS, s\),' WELL BE PLOTED 45 THE INDEPENDENT (X 


m 
AND ALL OTHERS WILL BE PLOTTED AS DEPENDENT (Y 
VARIABLES). 
JJITTER 
TRANSFORM 
GARY 
CON: SENTER z OF PLOTS PER SCREEN (3,40R 5)! 
ейї 
((NUP<3 )v (NUP>5))\CON 
NC-0.95 МИР 
be 05» (1-CINC+ (INC 6))),(0.05+(INC- (INC 6))),1 
ORS FERTR+NUP 
NUP 
OP3 i TC+IC+NUP 


6 

<0 

OOP2:R+R+1 
C<0 
zu 
4 
U 


з 
ы 
КЗ 
ны 
1 
S 


rar war ww wow r3 
со 


rt 
lee. Le eee WwW w WW) GW) CG) CG) GD GD AS AD AD AS AD AD AOD AD AD AO hea + ^к |J HI IL FI LO 
елау У О eae ey 
КЕ 


3 


T 
Р 


n 

L 

L 

X и :VAR] 

LOOP1:C<C 

у DATACCPRIB)::CTCA co] 

YU<YU+0.1 YU< С ДАТА засне: 

COM BS (PC-C))/SKIT 

POSN-PI-*((INC,( TNC), ІМС, С INC)) (€(C-1),(R-1),(C-1), 
XAXIS<NCU(7C+C) 3] 
y HERD 1 
5 R=NUP)v ((TR+R)=NPAG) ))/GRAPH 
( 
X 
Y 
( 
X 
C 
( 


95р РН 
XI (7С+С) 31 
15< 


R= HOP )y ((TR+R)= NPAG) )/GRAPH 
Х15<У4Х19< 


ооо-ооггок оюощлбпеовомньопо лос вомьоо со -JO01 [COND FS COL 


гг лг ae ae eee л rwowouaauwuwuiuuwia wa ww Ww 16 FFF 17-4 
шылу к лл ging rr gp gg qs 
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а )/MOV 

и ;YSVO 1¥1¥ .¥SPAVSPAVXAXIS¥YAXIS¥POSNY¥ 
о Ве "LIN БУ 701 1 1101 а 

р SMOOT 


MOV: Ви MMOVAV YUAX] 
YM-YMAV 
МИМО VA AY XCAX] 
«Y 
SHOOTH 13e 'nu9X; XMNY;YMNO 1919.9SPAVSPAVXAXISWVYAXISWPOSN 
VLINLX XUW! 


Uc aa "22127 701 1 1710 100008 


iE ori 'QuVXVYVOVIV.VSPAVSPANXAXISVYAXISVPOSNN 
HUE 'LIN LY LUST 13910 10508000 


BA 
RUN BASIC3 
SKIP: PUCUTRAR)2 (NPAC)) атадан де 
ЗА Е ИАС LOOP1 

R<NUP TR+R)<NPAG))/LOOP2 
END:+(ANS='Y!)/SKIP1 
GARY2 
SKIP1:PAUSE 
ERASE 
ee E E an 
гр 
MOR 
Lo 


TRt*R)«NPAG )/LOOP 
Q SAU ones TO LOOK AT ANOTHER VARIABLE?! 


Ops t TORE- LY 


гаси ии гит Mamana maam 
зала а ачодоо OOOO OOM WWMM 
ооо УО сл Б омон ооо Мо ПЕ POWONMONF WNE 
LLJL JL JL JL JL JL JL JL IL IL IL JL JC езт Ел с п и LSL 


FUNTION CRAPHER 
Го) КУНІН Шы s, CGR23 CR33 ANS33 25$ 215X15; A452 ANS SERRE 


[1] aaa DO NOT MOVE OR ERASE: GRAFSTAT FUNCTION HEADER 

] ana GRAFSTAT WILL NOT ADD À LINE TO THIS FUNCTION 
WITHOUT THIS HEADER 

ADMINS 

NN<N 

BP CPRED= "x4 > 


PGP 2 (DIM>3) 

DDR LP3,LPu)ULDIM] 

2: OUR: DATA SET IS NOT A TWO OR THREE DIMENSIONAL 

од IS BEING TERMINATED. PLEASE REFORMAT YOUR DATA 
REINITIATE GRAPHER ' 


P4U:NNN GRAPHER3 M 


э 
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а: 
OL- 1^( DATA) 
ER 


р 
TI 

RANSFORM 

К: па YOU WANT TO CONTINUE AND PLOT? (ENTER Y OR N )! 
W 


C 

Gh1z'¥')/0 

dem MATRIX POSITION ARE YOU REPRODUCING?! 
CENE POSITION ON THE SCREEEN? ! 

ARY3 
PÀe!! 


моол Е ом ооо моол ромеи о зл 


ммммммммю нн арр e осо МО слЕс N 
ил л JL JL JL л л и JL д л ощ д JL DL — LJ 


[oM M M M MC 0C M AC MC M M M MC M M M 13 гр сини га 


UAA LA LAQ WNS ZNO 
x hy 
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І (ЛИӘЗеТУҮ! 3/21 
еа УУ; :YSVO 1V1V.VSPAVSPANVNNNECGR2[2] ) 52V 


11); 
RERS*GRFRS ,'¥GR3VLINV1 1 1410 1100! 
RFRS 


ПЕТЕ! очӯрАТАГ; (СЕ2(2121%ГАТАГ; (СЕ2(112)1%.%91909 
GRPRCRER 'YNNNOE(GR202])3;] ¥NNNO(GR2[1]):1]9¥GR34 


L 
Ото‘ 
35] L1: RUN CRFR 
S61 TR 


EAS 


FUNTION GRAPHER3 


J NNN GRAPHER3 M 

aaa DO NOT MOVE OR ERASE; GRAFSTAT FUNCTION HEADER 
aan GRAFSTAT WILL NOT ADD A LINE TO THIS FUNCTION 
WITHOUT THIS HEADER 


ТА=М 
COL< 1+( DATA) 
LTT ER 


ANSFOR 
: ene YOU WANT TO CONTINUE AND PLOT? (ENTER Y OR N )' 


Ri 
CRIL'Y')/0 
AT MATRIX POSITION ARE YOU REPRODUCING?! 


=: 


Е 


MITS 
Да POSITION ON THE SCREEEN? ' 


NAD. =O бурар 
Ку кэ Б By 
НРУ 
ААА 
-C0 


ANSS3z! О 

Е: ЖАРУ”; Х5%0 1919,95РЛ%5РАУММУГ(СЕ2(212:1 
CRERS*ORFRS, В ТМ ТМ ОТО: 

SON CRFRS 


Ж. GRFR<' au¥DATAL; (GR2[2] )]J¥DATA[; (GR2[1] )]¥.71 
FoVSPANSPAP! 

GRFR<GRFR ee) JVNNNUCCR2L1] ); JVCESNLIN 
PLING. A 1910 

RUN GRFR 

RR 


ma m п Mannan maam 
мм MO юкюме PPRPPRPRPRPRPRPOONOMFEW NBEO 
лас C) мрою ONOMFONMRPOWLILINIIILINI LILI 


(112 (лл! (лл лл лд) 


FUNTION PLOTQUERY 
LOTQUERY 


PA«!! 

Q YOU WANT A PLOT OF YOUR LOWESS SMOOTHED CURVE? ' 
YESORNO)..... ENTER NO IF NOT USING GRAFSTAT' 
== 


D (PTz'Y!) 
ПРИ? x AXIS LABEL! 
INPUT Y AXIS LABEL! 
AXIS<O 
PLA (ROBt'Y') 
PHDR<+'ROBUST LOWESS SMOOTHING: F = 
RPLT+' au¥Xi¥Y1;YS¥0 1¥1¥. e+ SAceAVSSPAPPHDRPXAXIS¥ 


У4Х75%21%! 
RPLTCRPLT, INNGI NYI 1170100! 


D 
( 
EN 
1 


ч маллы yhy o ауу у 
D 
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[15] RUN RPL? 
[16] VIEW 
[17] PE? 
[18] PL1:PHDR<'NON-ROBUST LOWESS SMOOTHING: 
[191 МЕРІ BUSXIWYL; > YS¥0 1¥1¥.*«+ VA0® a9 SbADPHBRYXAXISY 
[20] NRPLT-NEPLT,' LINWLINW1 11890100! 
[211 RUN NRPLT 
[22] VIEW 
[23] PL2:'DO YOU WANT A PLOT OF | RESIDUALS | VS X?' 
C24] ' (YES OR NO)! 
[25] 255544 АП 
[26] END (@S5z'yY! 
[27] 'DO YOU WANT THIS PLOT SMOOTHED? ' 
[28] ! (YES OR NO)! 
[29] QS6-1 ^[) 
[30] ВЕТ | | RESIDUALS |" 
C31] PL3 (ОСБ) 
(321 X LOWS(|R 
[33] SRESPLT-'an1VXV(| RESY ) : YSWVO 
1919. «+ VAO@AVES SPASSPARXAXISVXRESIDT! 
[3u] SRESPLT-SRESPLT , 22WLINNLINWA1 11901009! 
(35) RUN SRESPLT 
[36] PAUSE 
[37] END 
рази PL3 Е BESPLT- 'aa19XVCI RESY )90919. x^ VAoOeAUSSPASSPAVXAXISW 
[357 BESPLPCRESPLT, '299LINVLINVA 11901009! 
[401 RUN RESPLT 
C41] PAUSE 
C42] END: 
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APPENDIX B 
SAMPLE PROGRAM EXECUTION 


1.  BOXPLOTTED TABLES 
This program, as mentioned in Chapter II is executed by typing BOXPLOTAB, 
and answering the queries as follows : 
BOXPLOTAB | 
28 YOUR DATA SET LOCATED IN THIS WORKSPACE? (YES/NO) 
ENTER THE NAME OF YOUR DATA SET 
| STOCK 


ENTER THE SCREEN LABEL 
ACTIVE STOCKS FOR THE WEEK ENDED AUG. 8, 1986 


DO YOU HAVE A (NCOLx20 CHARS ) MATRIX WITH THE NAMES OF 
COLUMNS Y/N? 
YES 
ENTER THE NAME OF THE MATRIX 
STOCKC 
DO YOU HAVE A (NROWx15 CHARS ) MATRIX WITH THE NAMES OF 
ROWS Y/N? 
YES 
ENTER THE NAME OF THE MATRIX 
| STOCKR 
роу YOU WANT THE DATA ORDERED BY THE FIRST COLUMN ? Y/N 


> POINT THE BOXPLOTTED TABLES ARE DISPLAYED ON THE 


ENTER 9 TO QUIT 

ENTER É TO ERASE AND CONTINUE 

ENTER C TO COPY AND CONTINUE 

ENTER CE TO COPY, ERASE AND CONTINUE 
„PRESS ENTER ONLY TO CONTINUE 


Eon ТО JOIN WITH LINES DATA POINTS OF THE SAME 

YES 

MEE THE POSITION OF THE DATA POINT (ENTER O TO FINISH) 
1 

Л THE POSITION OF THE DATA POINT (ENTER 0 TO FINISH) 
0 


63 


ENTER Q TO QUIT 

ENTER É TO ERASE AND CONTINUE 

ENTER C TO COPY AND CÓNTINUE 

ENTER CE TO COPY, ERASE AND CONTINUE 
PRESS ENTER ONLY TO CONTINUE 


2. STARE PLOTS 
This program, as mentioned in Chapter III is executed by typing ЗЭТАКРФРИ 
and answering the queries as follows : 
STARPLOT 
TYPE (S) FOR STAR PLOT OR (P) FOR PROFILE PLOT 
S 
ШЕ aU DATA SET LOCATED IN THIS WORKSPACE? (YES/NO) 


ENTER THE NAME OF YOUR DATA SET 
` AUTOS 
DO YOU HAVE A (NROWX20 CHARS ) ARRAY WITH NAMES OF ROWS Y/N? 


ENTER THE NAME OF THE MATRIX OF NAMES 

2 AUTOSR 

DO YOU HAVE A (NCOLx20 CHARS) MATRIX WITH THE NAMES OF 
COLUMNS Y/N? 

ENTER THE MATRIX WITH THE NAMES 
D AUTOSC 


DO YOU WANT ALL COLUMNS OF YOUR MATRIX OR SELECTED COL. 
ALL/SEL? 


SEL 


ee AS A VECTOR THE SELECTED COLUMNS 
| 112 
о ALL THE ROWS OF YOUR MATRIX OR SELECTED ROWS 


ANY VARIABLES DO YOU WANT TO HAVE TRANSFORMED ? 
O IF YOU WANT NONE 


0 
ENTER NUMBER OF PLOTS PER SCREEN. (34 OR 5) 
i 5 
(AT THIS POINT THE STAR PLOT IS SHOWN ON THE SCREEN ) 


WYS DON 
С 


04 


ENTER 9 TO QUIT 
ENTER É TO ÉRASE AND CONTINUE 

ENTER C TO COPY AND CÓNTINUE 

ENTER CE TO COPY, ERASE AND CONTINUE 
PRESS ENTER ONLY TO CONTINUE 


CE 


3. PROFILE PLOTS 
This program, as mentioned in Chapter III 1s executed by typing STARPLOT, 
and answering the queries as follows : 
STARPLOT 
TYPE (S) FOR STAR PLOT OR (P) FOR PROFILE PLOT 
E 
ео DATA SET LOCATED IN THIS WORKSPACE? (YES/NO) 


ENTER THE NAME OF YOUR DATA SET 
i AUTOS 
DO 100 HAVE A (NROWx20 CHARS ) ARRAY WITH NAMES OF ROWS Y/N? 


ENTER THE NAME OF THE MATRIX OF NAMES 

AUTOSR 
DO YOU HAVE A (NCOLX20 CHARS ) MATRIX WITH THE NAMES OF 
NS Y/N? 
ENTER THE MATRIX WITH THE NAMES 

AUTOSC 
DO YOU WANT ALL COLUMNS OF YOUR MATRIX OR SELECTED COL. 
ALL/SEL? 


ENTER AS A VECTOR THE SELECTED COLUMNS 
112 
DO YOU WANT ALL THE ROWS OF YOUR MATRIX OR SELECTED ROWS 
ШИОРИ СЕТ) 
ALL 
HOW MANY VARIABLES DO YOU WANT TO HAVE TRANSFORMED ? 
TYPE O IF YOU WANT NONE 
у 0 
ENTER NUMBER OF PLOTS PER SCREEN (340R5) 
5 
(AT THIS POINT THE STAR PLOT IS SHOWN ON THE SCREEN) 


ENTER Q TO QUIT 
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ENTER E TO ERASE AND CONTINUE 

ENTER C TO COPY AND CONTINUE 

ENTER CE TO COPY, ERASE AND CONTINUE 
PRESS ENTER ONLY TO CONTINUE 


СЕ 


4. CODED SCATTER PLOTS 
This program, as mentioned in Chapter III is executed by typing SCATPLOT, 


and answering the queries as follows : 


SCATPLOT 
15 pou DATA SET LOCATED IN THIS WORKSPACE? (YES/NO) 


ENTER THE NAME OF YOUR DATA SET 
| AUTOS 
FROM NOW ON YOUR DATA SET WILL BE CALLED DATA (IN THIS PROGRAM) 


ENTER THE SCREEN HEADER 
AUTOMOBILE DATA; PRICE VSM.P.G. CITY 


ENTER THE PLOT HEADER 
USA =A, FOREIGN = F AND WEIGHT = SIZE OF LETTER 


ENTER THE COLUMN NUMBER FOR THE VARIABLE ON THE X-AXIS 
Р 1 

ENTER THE LABEL FOR THE X AXIS 

PRICE 


A WANT ALL THE VALUES OF X OR JUST A SUBSAMPLE OF IT (ALL/SUB) 


ENTER THE COLUMN NUMBER FOR THE VARIABLE ON THE Y-AXIS 
| 2 


ENTER THE LABEL FOR THE Y AXIS 
M E-G. CILY 


DO YOU WANT ALL THE VALUES OF Y OR JUST A SUBSAMPLE OF IT (ALL/SUB) 
HOW MANY VARIABLES DO YOU DESIRE JITTERED? 
ТҮРЕ 0 IF YOU WANT NONE 

0 


HOW MANY VARIABLES DO YOU WANT TO HAVE TRANSFORMED ? 
nee O IF YOU WANT NONE 


0 
eee THE POSITION FOR THE PLOT E 6G wie eee ee 


EA 
E 
> 


1 I 11 TOT 
ENTER IN AN APL EXPRESSION FOR THIS CATEGORY 
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TA 


(DATAC;13]=1)A(DATAL;8]< 
ak THE SYMBOL (ANY LETTER ,NUMBER OR SPECIAL CHARACTER) 


№ 
ens 
oy 
осу 
why 
O 
ry 


ENTER THE COLOR (WHITE ,GREEN , BLUE , TURQUOISE ,RED,YELLOW OR PINK 
ENTER THE SIZE , ASA NUMBER BETWEEN 1 (SMALL) AND 12 (BIG) 
ENTER A LABEL (DESCRIPTION) FOR THIS CATEGORY (MAX 25 CHARS. ) 


USA < 2500 LB. 
a WANT ANOTHER CATEGORY (YES/NO) 


(РАТА |: 81<30 


5 
Bee sia ACC : 00)) 
SYMBOL (ANY LETTER ,NUMBER OR SPECIAL CHARACTER ) 


(DAT 
ENTER THE 
A 

ENTER THE COLOR (WHITE,GREEN,BLUE,TURQUOISE,RED,YELLOW OR PINK 
ENTER THE SIZE , ASA NUMBER BETWEEN 1 (SMALL) AND 12 (BIG) 

ENTER A LABEL (DESCRIPTION ) FOR THIS CATEGORY (MAX 25 CHARS.) 
2500 < USA < 3000 LB. 

DO ЕШ WANT ANOTHER CATEGORY (YES/NO) 

THIS CATEGORY 


R 
) ; АКРАЕДИ 653500 ) ) 
MBOL (ANY LETTER,NUMBER OR SPECIAL CHARACTER) 


ENTER THE COLOR (NHITE,GREEN,BLUE,TURQUOISE,RED,YELLOW OR PINK 
ENTER THE SIZE , ASA NUMBER BETWEEN 1 (SMALL) AND 12 (BIG) 

ENTER A LABEL (DESCRIPTION) FOR THIS CATEGORY (MAX 25 CHARS. ) 
3000 < USA < 3500 LB. 

Е WANT ANOTHER CATEGORY (YES/NO) 


uuuuuuuyyuyuuyguuuyuuuuuuyuuugyuyuyu 
RESSION M zr CATEGORY 
^ 2 


S 
ATAL: 13] 
A THE S 


Ur] 


ANTER THE COLOR (WHITE ,GREEN,BLUE,TURQUOISE,RED,YELLOW OR PINK 
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при THE SIZE , ASA NUMBER BETWEEN 1 (SMALL) AND 12 (BIG) 


ENTER A LABEL (DESCRIPTION ) FOR THIS CATEGORY (MAX 25 CHARS. ) 
USA » 3500 LB. 


p ши WANT ANOTHER CATEGORY (YES/NO) 


OO 
Е 
IA 
KO 


YHBOL (ANY LETTER ,NUMBER OR SPECIAL CHARACTER ) 

ENTER THE COLOR (WHITE ,GREEN , BLUE , TURQUOISE ,RED ,YELLOW OR PINK 
ENTER THE SIZE , AS A NUMBER BETWEEN 1 (SMALL) AND 12 (BIG) 

ENTER A LABEL (DESCRIPTION ) FOR THIS CATEGORY (MAX 25 CHARS. ) 


FOREIGN s 2500 LB. 
DH WANT ANOTHER CATEGORY (YES/NO) 


S 
1 (CD (DATAU:81€3000)) 
ENTER THE SYMBOL (ANY LETTER,NUMBER OR SPECIAL CHARACTER) 


E THE COLOR (WHITE,GREEN,BLUE,TURQUOISE,RED,YELLOW OR PINK 
Ed THE SIZE , ASA NUMBER BETWEEN 1 (SMALL ) AND 12 (BIC) 
ENTER À LABEL (DESCRIPTION) FOR THIS CATEGORY (MAX 25 CHARS.) 


2500 < ЕОВЕТ. < 3000 ГВ. 
22 ЕШ WANT ANOTHER CATEGORY (YES/NO) 


(ГАТАГ:81<3500)) 
ANY LETTER ,NUMBER OR SPECIAL CHARACTER ) 


реа THE COLOR (WHITE ,GREEN, BLUE ,TURQUOISE ,RED,YELLOW OR PINK 
БОЕВЕ THE SIZE , AS A NUMBER BETWEEN 1 (SMALL) AND 12 (BIG) 
ENTER A LABEL (DESCRIPTION) FOR THIS CATEGORY (MAX 25 CHARS. ) 


3000 < ЕОВЕТ. < 3500 LB. 
20 YOU WANT ANOTHER CATEGORY (YES/NO) 
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5. CODED DRAFTSMAN PLOTS 
This program, as mentioned in Chapter IHI is executed by typing 
DRAFTSMAN, and answering the queries as follows : 


DRAFTSMAN 

IS YOUR DATA SET LOCATED IN THIS WORKSPACE? 
(YES OR NO) 
YES 
ENTER THE NAME OF YOUR DATA SET 

| AUTOS 
DO YOU WANT ALL OF THIS DATA OR JUST A SUBSAMPLE OF IT TO 
BE PRESENTED IN THE DRAFTSMAN DISPLAY? ENTER (ALL OR SUB) 


DO YOU HAVE ATWO DIMENSIONAL ARRAY OF NAMES FORTHE DATA 
WHICH IS TO BE DISPLAYED? NOTE: THESE NAMES ARE THE NAMES 
OF THE VARIABLES REPRSENTED BY THE COLUMNS OF YOUR DATA SET. 
‘ela OR NO) 


WHAT IS THE NAME OF YOUR ARRAY OF VARIABLE NAMES? 


| AUTOSC 


HOW MANY VARIABLES DO YOU DESIRE JITTERED? 
TYPE 0 IF YOU WANT NONE 
| 0 
HOW MANY VARIABLES DO YOU WANT TO HAVE TRANSFORMED ? 
ТҮРЕ O TF YOU WANT NONE 
| 0 


DO YOU WANT TO DO WANT TO FIT A SMOOTHED CURVE 
a ALLDRAFTAMAN PLOTS? ...(YES OR NO) 


DEC WANT ASYMBOLIC DRAFTSMAN (YES/NO) 


ENTER AS A VECTOR THE VARIABLES (COLUMNS ) THAT YOU WHISH TO HAVE 
ДМ THE X AND Y AXIS (THE FIRST AND SECOND DIMENSION FOR THE PLOT ) 


шаъзо 


NEXT, YOU HAVE ТО ENTER APL EXPRESSION FOR EACH CATEGORY (CODE) 
USE ХХ AS THE NAME OF YOUR AR RRAY 
Mii l= LOO VAC KX L 2 oF 0) 
VARSE T AND J REPRESENT COLUMN. NUMBERS BETWEEN 1 AND 4 
BE CAREFULLY NOT TO OVERLAP VALUES 


WHEN THE PROGRAM ASK FOR SYMBOLS TYPE ANY (ONE) CHARACTER 
FOR COLORS TYPE THE NAME OF THE COLOR I.E. BLUE OR RED 
WITH SIZES 1 REPRESENT SMALL AND 12 BIG 

Г EXPRESSION FOR THE CATEGORY (CODE) NUMBER 1 
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NER THE SYMBOL 

ENTER THE COLOR 

RED 

ca THE SIZE 

vo iud WHISH ANOTHER CATEGORY (YES/NO) 
A Ee EXPRESSION FOR THE CATEGORY (CODE) NUMBER 2 
л = 

eee TRE SYMBOL 

ENTER THE COLOR 

RED 

eee THE SIZE 


ae YOU WHISH ANOTHER CATEGORY (YES/NO) 


YOU HAVE NOW 4 BASIC VARIABLES TO PLOT 
ENTER NUMBER OF PLOTS PER SCREEN (38055) 


H 


DO YOU WANT TO FIT A SMOOTHED CURVE 
on SELECTED PLOTS? ...(YES OR NO) 


(AT THIS POINT THE CODED DRAFTSMAN PLOT IS SHOWN ON THE SCREEN) 


ENTER TO UIT 

ENTER RASE AND CONTINUE 

ENTER C 70 COPY AND CONTINUE 

ENTER CE TO COPY, ERASE AND CONTINUE 
PRESS ENTER ONLY TO CONTINUE 
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APPENDIX C 
STAR PLOTS OF AUTOMOBILE DATA 


Bs в» тд! 
"NP Ww ID. 
S O QJ A 
За В 16) 12 
3 iii IB 
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